CINECA IRIS Institutional Research Information System

Solving a classification problem for a neural network means looking for a particular configuration of the internal parameters. This is commonly achieved by minimizing non-convex object functions. Hence, the same classification problem is likely to have several, different, equally valid solutions, depending on a number of factors like the initialization and the adopted optimizer. In this work, we propose an algorithm which looks for a zero-error path joining two solutions to the same classification problem. We witness that finding such a path is typically not a trivial problem; however, our heuristics is able to succeed in such a task. This is a step forward to explain why simple training heuristics (like SGD) are able to train complex neural networks: we speculate they focus on particular solutions, which belong to a connected solution sub-space. We work in two different scenarios: a synthetic, unbiased and totally-uncorrelated (hard) training problem, and MNIST. We empirically show that the algorithmically-accessible solutions space is connected, and we have hints suggesting it is a convex sub-space. © 2019, Springer Nature Switzerland AG.

Take a Ramble into Solution Spaces for Classification Problems in Neural Networks

Tartaglione, Enzo;Grangetto, Marco

2019-01-01

Abstract

Solving a classification problem for a neural network means looking for a particular configuration of the internal parameters. This is commonly achieved by minimizing non-convex object functions. Hence, the same classification problem is likely to have several, different, equally valid solutions, depending on a number of factors like the initialization and the adopted optimizer. In this work, we propose an algorithm which looks for a zero-error path joining two solutions to the same classification problem. We witness that finding such a path is typically not a trivial problem; however, our heuristics is able to succeed in such a task. This is a step forward to explain why simple training heuristics (like SGD) are able to train complex neural networks: we speculate they focus on particular solutions, which belong to a connected solution sub-space. We work in two different scenarios: a synthetic, unbiased and totally-uncorrelated (hard) training problem, and MNIST. We empirically show that the algorithmically-accessible solutions space is connected, and we have hints suggesting it is a convex sub-space. © 2019, Springer Nature Switzerland AG.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2019
			
	Titolo dell'evento
	
				International Conference on Image Analysis and Processing, ICIAP 2019
			
	Luogo dell'evento
	
				Trento
			
	Data dell'evento
	
				9/9/2019
			
	Titolo del volume
	
				International Conference on Image Analysis and Processing, ICIAP 2019
			
	Nome editore
	
				Springer Verlag
			
	N. Volume
	
				11751
			
	Pagine (da)
	
				345
			
	Pagine (a)
	
				355
			
	Codice ISBN
	
				978-3-030-30641-0
978-3-030-30642-7
			
	DOI
	
				https://dx.doi.org/10.1007/978-3-030-30642-7_31
			
	Tutti gli autori
	
						Tartaglione, Enzo; Grangetto, Marco
					
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

File	Dimensione	Formato
ICIAP19_takearamble.pdf Accesso aperto Tipo di file: POSTPRINT (VERSIONE FINALE DELL’AUTORE) Dimensione 323.92 kB Formato Adobe PDF Visualizza/Apri	323.92 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1714235

Citazioni

ND

5

4

social impact