CINECA IRIS Institutional Research Information System

The computing capacity needed to process the data generated in modern scientific experiments is approaching ExaFLOPs. Currently, achieving such performances is only feasible through GPU-accelerated supercomputers. Different languages were developed to program GPUs at different levels of abstraction. Typically, the more abstract the languages, the more portable they are across different GPUs. However, the less abstract and co-designed with the hardware, the more room for code optimization and, eventually, the more performance. In the HPC context, portability and performance are a fairly traditional dichotomy. The current C++ Parallel Standard Template Library (PSTL) has the potential to go beyond this dichotomy. In this work, we analyze the main performance benefits and limitations of PSTL using as a use-case the Gaia Astrometric Verification Unit-Global Sphere Reconstruction parallel solver developed by the European Space Agency Gaia mission. The code aims to find the astrometric parameters of $$\sim10^8$$stars in the Milky Way by iteratively solving a linear system of equations with the LSQR algorithm, originally GPU-ported with the CUDA language. We show that the performance obtained with the PSTL version, which is intrinsically more portable than CUDA, is comparable to the CUDA one on NVIDIA GPU architecture.

Toward HPC application portability via C++ PSTL: the Gaia AVU-GSR code assessment

Malenza Giulio^First;Cesare Valentina;Aldinucci Marco;Becciani Ugo;Vecchiato Alberto

2024-01-01

Abstract

The computing capacity needed to process the data generated in modern scientific experiments is approaching ExaFLOPs. Currently, achieving such performances is only feasible through GPU-accelerated supercomputers. Different languages were developed to program GPUs at different levels of abstraction. Typically, the more abstract the languages, the more portable they are across different GPUs. However, the less abstract and co-designed with the hardware, the more room for code optimization and, eventually, the more performance. In the HPC context, portability and performance are a fairly traditional dichotomy. The current C++ Parallel Standard Template Library (PSTL) has the potential to go beyond this dichotomy. In this work, we analyze the main performance benefits and limitations of PSTL using as a use-case the Gaia Astrometric Verification Unit-Global Sphere Reconstruction parallel solver developed by the European Space Agency Gaia mission. The code aims to find the astrometric parameters of $$\sim10^8$$stars in the Milky Way by iteratively solving a linear system of equations with the LSQR algorithm, originally GPU-ported with the CUDA language. We show that the performance obtained with the PSTL version, which is intrinsically more portable than CUDA, is comparable to the CUDA one on NVIDIA GPU architecture.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Lingua di pubblicazione
	
				Inglese
			
	Codice ISI WoS
	
				WOS:001186846200001
			
	Codice Scopus
	
				2-s2.0-85188136338
			
	Referee
	
				Comitato scientifico
			
	Titolo rivista
	
				THE JOURNAL OF SUPERCOMPUTING
			
	Pagine (da)
	
				1
			
	Pagine (a)
	
				22
			
	Numero di pagine totale
	
				22
			
	DOI
	
				https://dx.doi.org/10.1007/s11227-024-06011-1
			
	Parole Chiave
	
				High-performance computing,Standard parallelism,GPU programming,Astrometry
			
	Coautori affiliati a enti stranieri
	
				no
			
	Progetto
	
	Titolo Progetto
	
									Third Party CINI  - "EUPEX - EUROPEAN PILOT FOR EXASCALE" (H2020-JTI-EuroHPC-2020-1)
								
	Acronimo
	
									EUPEX
								
	Nome finanziatore
	
										EUROPEAN COMMISSION
									
	Finanziamento
	
									H2020
								
	N. Contratto
	
									ALDINUCCI M. - H2020 RIA G.A. n. 101033975
								
	Prodotto conforme al Regolamento di Ateneo sull'accesso aperto?
	
				1 – prodotto con  file in versione Open Access (allegherò il file al passo 6 - Carica)
			
	Tipologia sito docente
	
				262
			
	Numero autori
	
				5
			
	Tutti gli autori
	
						Malenza Giulio;Cesare Valentina;Aldinucci Marco;Becciani Ugo;Vecchiato Alberto;
					
	Tipologia
	
				info:eu-repo/semantics/article
			
	Fulltext
	
				partially_open
			
	Tipologia
	
				03-CONTRIBUTO IN RIVISTA::03A-Articolo su Rivista
			
	Appare nelle tipologie:
	
				03A-Articolo su Rivista

File in questo prodotto:

File	Dimensione	Formato
GAIAFINALDECISION.pdf Accesso riservato Tipo di file: PDF EDITORIALE Dimensione 1.75 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.75 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
_JSUPE_SI23__Gaia_GPU-1.pdf Accesso aperto Tipo di file: PREPRINT (PRIMA BOZZA) Dimensione 771.4 kB Formato Adobe PDF Visualizza/Apri	771.4 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1967551

Citazioni

ND

2

2

social impact