CINECA IRIS Institutional Research Information System

In the last decade, deep learning models competed for performance at the price of tremendous computational costs. Such a critical aspect recently attracted more attention for both the training and inference phases. The latter is obviously orders of magnitude lower than the training complexity, but on the other hand, it contributes many times, which impacts efficiency on edge or embedded devices. Inference can be made efficient through neural network pruning, which consists of parameters and neurons' removal from the model's topology while maintaining the model's accuracy. This results in reduced resource and energy requirements for the models. This paper describes two pruning procedures for lowering the operations required during the inference phase and a method to exploit the resulting sparsity. The same cannot be applied at training time: we show it is possible to borrow similar ideas to reduce the cost of gradient backpropagation by disabling the computation for selected neurons.

A round-trip journey in pruned artificial neural networks

Bragagnolo A.;Tartaglione E.;Dalmasso G.;Grangetto M.

2023-01-01

Abstract

In the last decade, deep learning models competed for performance at the price of tremendous computational costs. Such a critical aspect recently attracted more attention for both the training and inference phases. The latter is obviously orders of magnitude lower than the training complexity, but on the other hand, it contributes many times, which impacts efficiency on edge or embedded devices. Inference can be made efficient through neural network pruning, which consists of parameters and neurons' removal from the model's topology while maintaining the model's accuracy. This results in reduced resource and energy requirements for the models. This paper describes two pruning procedures for lowering the operations required during the inference phase and a method to exploit the resulting sparsity. The same cannot be applied at training time: we show it is possible to borrow similar ideas to reduce the cost of gradient backpropagation by disabling the computation for selected neurons.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Lingua di pubblicazione
	
				Inglese
			
	Su invito
	
				contributo
			
	Tipo di evento
	
				1 - Conferenza
			
	Titolo dell'evento
	
				2023 Italia Intelligenza Artificiale - Thematic Workshops, Ital-IA 2023
			
	Luogo dell'evento
	
				ita
			
	Data dell'evento
	
				2023
			
	Rilevanza dell'evento
	
				Nazionale
			
	Titolo del volume
	
				CEUR Workshop Proceedings
			
	Referee
	
				Comitato scientifico
			
	Nome editore
	
				CEUR-WS
			
	Città editore
	
				Aachen
			
	Nazione editore
	
				GERMANIA
			
	N. Volume
	
				3486
			
	Pagine (da)
	
				561
			
	Pagine (a)
	
				566
			
	Numero di Pagine
	
				6
			
	Titolo della serie (se presente ISSN)
	
				CEUR WORKSHOP PROCEEDINGS
			
	Codice Scopus
	
				2-s2.0-85173793739
			
	Parole Chiave
	
				Deep Learning; Efficiency; Pruning
			
	Coautori affiliati a enti stranieri
	
				sì
			
	Nazione dell'ente di affiliazione
	
				FRANCIA
			
	Prodotto conforme al Regolamento di Ateneo sull'accesso aperto?
	
				1 – prodotto con  file in versione Open Access (allegherò il file al passo 6 - Carica)
			
	Numero autori
	
				4
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Tipologia
	
				04-CONTRIBUTO IN ATTI DI CONVEGNO::04A-Conference paper in volume
			
	Tutti gli autori
	
						Bragagnolo A.; Tartaglione E.; Dalmasso G.; Grangetto M.
					
	Tipologia sito docente
	
				273
			
	Fulltext
	
				open
			
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

File	Dimensione	Formato
output.pdf Accesso aperto Tipo di file: POSTPRINT (VERSIONE FINALE DELL’AUTORE) Dimensione 226.56 kB Formato Adobe PDF Visualizza/Apri	226.56 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1951391

Citazioni

ND

0

ND

social impact