The solver module of the Astrometric Verification Unit Global Sphere Reconstruction (AVU GSR) pipeline aims to find the astrometric parameters of ∼108 stars in the Milky Way, the attitude and instrumental settings of the Gaia satellite, and the parametrized post Newtonian parameter γ with a resolution of 10-100 micro-arcseconds. To perform this task, the code, which runs in production on Leonardo CINECA infrastructure, solves a system of linear equations with the iterative LSQR algorithm, where the coefficient matrix is large (10-50 TB) and sparse and the iterations stop when least square convergence is reached. The solver was ported to GPU with CUDA, obtaining a ∼14x acceleration factor over an original version CPU-parallelized with OpenMP. This work concentrates on a code section dedicated to covariances calculation, representing an important scientific task for Gaia mission, since the problems unknowns present strong correlations. Given the number of unknowns at mission end, the variances-covariances matrix is expected to occupy ∼1 EB, which represents a substantial "Big Data"issue. To compute a subset of the total covariances, we defined an I/Obased pipeline made of two jobs. The first job, the LSQR, writes the files every itnCovCP iterations, and the second job reads them and calculates the corresponding covariances. The two jobs can be launched either in sequence or concurrently. Previous studies demonstrated that the covariances calculation does not significantly slowdown the AVU-GSR production up to ∼3 107 covariances. Here we investigate the performance of the covariances pipeline as a function of itnCovCP. The results show that writing smaller files more frequently or writing larger files less frequently does not affect the global performance of the solver, whose speed only depends on the number of covariances to calculate and of system unknowns.

Covariances computation in the Gaia AVU-GSR Parallel Solver with I/O techniques: a performance study as a function of writing cycle length

Cesare V.;Lattanzi M. G.;Aldinucci M.;Bucciarelli B.
2025-01-01

Abstract

The solver module of the Astrometric Verification Unit Global Sphere Reconstruction (AVU GSR) pipeline aims to find the astrometric parameters of ∼108 stars in the Milky Way, the attitude and instrumental settings of the Gaia satellite, and the parametrized post Newtonian parameter γ with a resolution of 10-100 micro-arcseconds. To perform this task, the code, which runs in production on Leonardo CINECA infrastructure, solves a system of linear equations with the iterative LSQR algorithm, where the coefficient matrix is large (10-50 TB) and sparse and the iterations stop when least square convergence is reached. The solver was ported to GPU with CUDA, obtaining a ∼14x acceleration factor over an original version CPU-parallelized with OpenMP. This work concentrates on a code section dedicated to covariances calculation, representing an important scientific task for Gaia mission, since the problems unknowns present strong correlations. Given the number of unknowns at mission end, the variances-covariances matrix is expected to occupy ∼1 EB, which represents a substantial "Big Data"issue. To compute a subset of the total covariances, we defined an I/Obased pipeline made of two jobs. The first job, the LSQR, writes the files every itnCovCP iterations, and the second job reads them and calculates the corresponding covariances. The two jobs can be launched either in sequence or concurrently. Previous studies demonstrated that the covariances calculation does not significantly slowdown the AVU-GSR production up to ∼3 107 covariances. Here we investigate the performance of the covariances pipeline as a function of itnCovCP. The results show that writing smaller files more frequently or writing larger files less frequently does not affect the global performance of the solver, whose speed only depends on the number of covariances to calculate and of system unknowns.
2025
33rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2025
Torino, Italy
2025
Proceedings - 33rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2025
Institute of Electrical and Electronics Engineers Inc.
404
411
Covariances computation; High Performance Computing; I/O
Cesare V.; Becciani U.; Vecchiato A.; Lattanzi M.G.; Aldinucci M.; Bucciarelli B.
File in questo prodotto:
File Dimensione Formato  
Gaia_AVU_GSR_PDP_2025_ACOHPC_Special_Session_Camera_Ready.pdf

Accesso aperto

Descrizione: postprint
Tipo di file: POSTPRINT (VERSIONE FINALE DELL’AUTORE)
Dimensione 443.8 kB
Formato Adobe PDF
443.8 kB Adobe PDF Visualizza/Apri
249300a404.pdf

Accesso riservato

Descrizione: editorial
Tipo di file: PDF EDITORIALE
Dimensione 412.34 kB
Formato Adobe PDF
412.34 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/2076143
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact