The ever-increasing number of parameters in deep neural networks poses challenges for memory-limited applications. Regularize-and-prune methods aim at meeting these challenges by sparsifying the network weights. In this context we quantify the output sensitivity to the parameters (i.e. their relevance to the network output) and introduce a regularization term that gradually lowers the absolute value of parameters with low sensitivity. Thus, a very large fraction of the parameters approach zero and are eventually set to zero by simple thresholding. Our method surpasses most of the recent techniques both in terms of sparsity and error rates. In some cases, the method reaches twice the sparsity obtained by other techniques at equal error rates.

Learning sparse neural networks via sensitivity-driven regularization

Tartaglione E.;Fiandrotti A.;Francini G.
2018-01-01

Abstract

The ever-increasing number of parameters in deep neural networks poses challenges for memory-limited applications. Regularize-and-prune methods aim at meeting these challenges by sparsifying the network weights. In this context we quantify the output sensitivity to the parameters (i.e. their relevance to the network output) and introduce a regularization term that gradually lowers the absolute value of parameters with low sensitivity. Thus, a very large fraction of the parameters approach zero and are eventually set to zero by simple thresholding. Our method surpasses most of the recent techniques both in terms of sparsity and error rates. In some cases, the method reaches twice the sparsity obtained by other techniques at equal error rates.
2018
32nd Conference on Neural Information Processing Systems, NeurIPS 2018
Montréal, Canada
2018
Advances in Neural Information Processing Systems
MIT Press
2018-
3878
3888
https://proceedings.neurips.cc/paper_files/paper/2018/file/04df4d434d481c5bb723be1b6df1ee65-Paper.pdf
Tartaglione E.; Fiandrotti A.; Lepsoy S.; Francini G.
File in questo prodotto:
File Dimensione Formato  
NeurIPS-2018-learning-sparse-neural-networks-via-sensitivity-driven-regularization-Paper.pdf

Accesso aperto

Descrizione: open access document
Tipo di file: POSTPRINT (VERSIONE FINALE DELL’AUTORE)
Dimensione 367.63 kB
Formato Adobe PDF
367.63 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1965431
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 56
  • ???jsp.display-item.citation.isi??? ND
social impact