CINECA IRIS Institutional Research Information System

Recently, a race towards the simplification of deep networks has begun, showing that it is effectively possible to reduce the size of these models with minimal or no performance loss. However, there is a general lack in understanding why these pruning strategies are effective. In this work, we are going to compare and analyze pruned solutions with two different pruning approaches, one-shot and gradual, showing the higher effectiveness of the latter. In particular, we find that gradual pruning allows access to narrow, well-generalizing minima, which are typically ignored when using one-shot approaches. In this work we also propose PSP-entropy, a measure to understand how a given neuron correlates to some specific learned classes. Interestingly, we observe that the features extracted by iteratively-pruned models are less correlated to specific classes, potentially making these models a better fit in transfer learning approaches.

Pruning Artificial Neural Networks: A Way to Find Well-Generalizing, High-Entropy Sharp Minima

Tartaglione E.;Bragagnolo A.;Grangetto M.^Last

2020-01-01

Abstract

Recently, a race towards the simplification of deep networks has begun, showing that it is effectively possible to reduce the size of these models with minimal or no performance loss. However, there is a general lack in understanding why these pruning strategies are effective. In this work, we are going to compare and analyze pruned solutions with two different pruning approaches, one-shot and gradual, showing the higher effectiveness of the latter. In particular, we find that gradual pruning allows access to narrow, well-generalizing minima, which are typically ignored when using one-shot approaches. In this work we also propose PSP-entropy, a measure to understand how a given neuron correlates to some specific learned classes. Interestingly, we observe that the features extracted by iteratively-pruned models are less correlated to specific classes, potentially making these models a better fit in transfer learning approaches.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2020
			
	Lingua di pubblicazione
	
				Inglese
			
	Su invito
	
				contributo
			
	Tipo di evento
	
				1 - Conferenza
			
	Titolo dell'evento
	
				29th International Conference on Artificial Neural Networks, ICANN 2020
			
	Luogo dell'evento
	
				svk
			
	Data dell'evento
	
				2020
			
	Rilevanza dell'evento
	
				Internazionale
			
	Titolo del volume
	
				Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
			
	Referee
	
				Esperti anonimi
			
	Nome editore
	
				Springer Science and Business Media Deutschland GmbH
			
	Città editore
	
				Berlino
			
	Nazione editore
	
				GERMANIA
			
	N. Volume
	
				12397
			
	Pagine (da)
	
				67
			
	Pagine (a)
	
				78
			
	Numero di Pagine
	
				12
			
	Titolo della serie (se presente ISSN)
	
				LECTURE NOTES IN ARTIFICIAL INTELLIGENCE
			
	Codice ISBN
	
				978-3-030-61615-1
978-3-030-61616-8
			
	Codice ISI WoS
	
				WOS:000713797800006
			
	Codice Scopus
	
				2-s2.0-85094099522
			
	DOI
	
				https://dx.doi.org/10.1007/978-3-030-61616-8_6
			
	Parole Chiave
	
				Deep learning; Entropy; Post synaptic potential; Pruning; Sharp minima
			
	Coautori affiliati a enti stranieri
	
				no
			
	Prodotto conforme al Regolamento di Ateneo sull'accesso aperto?
	
				1 – prodotto con  file in versione Open Access (allegherò il file al passo 6 - Carica)
			
	Numero autori
	
				3
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Tipologia
	
				04-CONTRIBUTO IN ATTI DI CONVEGNO::04A-Conference paper in volume
			
	Tutti gli autori
	
						Tartaglione E.; Bragagnolo A.; Grangetto M.
					
	Tipologia sito docente
	
				273
			
	Fulltext
	
				open
			
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

File	Dimensione	Formato
ICANN20.pdf Accesso aperto Tipo di file: POSTPRINT (VERSIONE FINALE DELL’AUTORE) Dimensione 426.2 kB Formato Adobe PDF Visualizza/Apri	426.2 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1765267

Citazioni

ND

10

6

social impact