The HPC4AI datacenter hosted at the Computer Science Department of the University of Turin was born to address the exponentially increasing computing needs of cross-disciplinary research on AI. To address the needs of modern AI, HPC4AI rethinks the traditional usage of Cloud and HPC systems where the Cloud provides a modern interface for HPC, and HPC serves as an accelerator for the cloud. To date, it has supported 40+ research projects across a broad range of domains, from astronomy and medicine to human sciences. Furthermore, it acts as an R&D platform to study, develop, and test new datacenter technologies. As such, it hosts a zoo of exotic computing platforms and the first prototype of two-phase evaporative server cooling. This work describes operating and managing HPC4AI with its challenges and lessons learned, with an analysis of key opportunities for digital twins.

HPC4AI@UNITO: A Use Case For Datacenter Digital Twin

Robert Birke;Lavinia Chiara Tagliabue;Sergio Rabellino;Marco Aldinucci
2025-01-01

Abstract

The HPC4AI datacenter hosted at the Computer Science Department of the University of Turin was born to address the exponentially increasing computing needs of cross-disciplinary research on AI. To address the needs of modern AI, HPC4AI rethinks the traditional usage of Cloud and HPC systems where the Cloud provides a modern interface for HPC, and HPC serves as an accelerator for the cloud. To date, it has supported 40+ research projects across a broad range of domains, from astronomy and medicine to human sciences. Furthermore, it acts as an R&D platform to study, develop, and test new datacenter technologies. As such, it hosts a zoo of exotic computing platforms and the first prototype of two-phase evaporative server cooling. This work describes operating and managing HPC4AI with its challenges and lessons learned, with an analysis of key opportunities for digital twins.
2025
Euromicro International Conference on Parallel, Distributed and Network Based Processing
Torino
12-14 Mar 2025
Euromicro International Conference on Parallel, Distributed and Network Based Processing
IEEE
499
505
979-8-3315-2493-7
979-8-3315-2494-4
Viviana Vaccaro, Robert Birke, Lavinia Chiara Tagliabue, Sergio Rabellino, Marco Aldinucci
File in questo prodotto:
File Dimensione Formato  
249300a499.pdf

Accesso riservato

Descrizione: Editoriale
Tipo di file: PDF EDITORIALE
Dimensione 818.22 kB
Formato Adobe PDF
818.22 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
PDP_DT4DC_HPC4AI.pdf

Accesso aperto

Tipo di file: POSTPRINT (VERSIONE FINALE DELL’AUTORE)
Dimensione 1.56 MB
Formato Adobe PDF
1.56 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/2070516
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact