CINECA IRIS Institutional Research Information System

In the last decade the use of artificial neural networks (ANNs) in many fields like image processing or speech recognition has become a common practice because of their effectiveness to solve complex tasks. However, in such a rush, very little attention has been paid to security aspects. In this work we explore the possibility to embed a watermark into the ANN parameters. We exploit model redundancy and adaptation capacity to lock a subset of its parameters to carry the watermark sequence. The watermark can be extracted in a simple way to claim copyright on models but can be very easily attacked with model fine-tuning. To tackle this culprit we devise a novel watermark aware training strategy. We aim at delving into the loss landscape to find an optimal configuration of the parameters such that we are robust to fine-tuning attacks towards the watermarked parameters. Our experimental results on classical ANN models trained on well-known MNIST and CIFAR-10 datasets show that the proposed approach makes the embedded watermark robust to fine-tuning and compression attacks.

Delving in the loss landscape to embed robust watermarks into neural networks

Tartaglione E.;Grangetto M.;Cavagnino D.;Botta M.

2020-01-01

Abstract

In the last decade the use of artificial neural networks (ANNs) in many fields like image processing or speech recognition has become a common practice because of their effectiveness to solve complex tasks. However, in such a rush, very little attention has been paid to security aspects. In this work we explore the possibility to embed a watermark into the ANN parameters. We exploit model redundancy and adaptation capacity to lock a subset of its parameters to carry the watermark sequence. The watermark can be extracted in a simple way to claim copyright on models but can be very easily attacked with model fine-tuning. To tackle this culprit we devise a novel watermark aware training strategy. We aim at delving into the loss landscape to find an optimal configuration of the parameters such that we are robust to fine-tuning attacks towards the watermarked parameters. Our experimental results on classical ANN models trained on well-known MNIST and CIFAR-10 datasets show that the proposed approach makes the embedded watermark robust to fine-tuning and compression attacks.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2020
			
	Titolo dell'evento
	
				International Conference on Pattern Recognition
			
	Luogo dell'evento
	
				ita
			
	Data dell'evento
	
				2021
			
	Titolo del volume
	
				Proceedings - International Conference on Pattern Recognition
			
	Nome editore
	
				Institute of Electrical and Electronics Engineers Inc.
			
	Pagine (da)
	
				10666
			
	Pagine (a)
	
				10674
			
	Codice ISBN
	
				978-1-7281-8808-9
			
	DOI
	
				https://dx.doi.org/10.1109/ICPR48806.2021.9413062
			
	Tutti gli autori
	
						Tartaglione E.; Grangetto M.; Cavagnino D.; Botta M.
					
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

File	Dimensione	Formato
output-5.pdf Accesso aperto Tipo di file: POSTPRINT (VERSIONE FINALE DELL’AUTORE) Dimensione 514.23 kB Formato Adobe PDF Visualizza/Apri	514.23 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1891440

Citazioni

ND

12

15

social impact