CINECA IRIS Institutional Research Information System

This work presents a novel approach to distributed training of deep neural networks (DNNs) that aims to overcome the issues related to mainstream approaches to data parallel training. Established techniques for data parallel training are discussed from both a parallel computing and deep learning perspective, then a different approach is presented that is meant to allow DNN training to scale while retaining good convergence properties. Moreover, an experimental implementation is presented as well as some preliminary results.

Deep Learning at Scale

Paolo Viviani;Maurizio Drocco;Daniele Baccega;Iacopo Colonnelli;Marco Aldinucci

2019-01-01

Abstract

This work presents a novel approach to distributed training of deep neural networks (DNNs) that aims to overcome the issues related to mainstream approaches to data parallel training. Established techniques for data parallel training are discussed from both a parallel computing and deep learning perspective, then a different approach is presented that is meant to allow DNN training to scale while retaining good convergence properties. Moreover, an experimental implementation is presented as well as some preliminary results.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2019
			
	Titolo dell'evento
	
				Euromicro International Conference on Parallel, Distributed and Network Based Processing
			
	Luogo dell'evento
	
				Pavia, Italy
			
	Data dell'evento
	
				13-15 February 2019
			
	Titolo del volume
	
				Proc. of the 27th Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP)
			
	Nome editore
	
				IEEE
			
	Pagine (da)
	
				124
			
	Pagine (a)
	
				131
			
	Codice ISBN
	
				978-1-7281-1644-0
			
	DOI
	
				https://dx.doi.org/10.1109/EMPDP.2019.8671552
			
	URL del prodotto (archivi open access, fulltext su sito editore, etc.)
	
				https://ieeexplore.ieee.org/document/8671552
			
	Parole Chiave
	
				deep learning, distributed computing, machine learning, large scale, C++
			
	Tutti gli autori
	
						Paolo Viviani, Maurizio Drocco, Daniele Baccega, Iacopo Colonnelli, Marco Aldinucci
					
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

File	Dimensione	Formato
19_deeplearning_PDP.pdf Accesso aperto Tipo di file: POSTPRINT (VERSIONE FINALE DELL’AUTORE) Dimensione 404.27 kB Formato Adobe PDF Visualizza/Apri	404.27 kB	Adobe PDF	Visualizza/Apri
19_deeplearning_PDP_editorial.pdf Accesso riservato Descrizione: PDF Editoriale Tipo di file: PDF EDITORIALE Dimensione 173.9 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	173.9 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1695211

Citazioni

ND

7

4

social impact