CINECA IRIS Institutional Research Information System

Deep neural networks (DNNs) are becoming the core components of many applications running on edge devices, especially for real time image-based analysis. Increasingly, multi-faced knowledge is extracted via executing multiple DNNs inference models, e.g., identifying objects, faces, and genders from images. The response times of multi-DNN highly affect users’ quality of experience and safety as well. Different DNNs exhibit diversified resource requirements and execution patterns across layers and networks, which may easily exceed the available device memory and riskily degrade the responsiveness. In this paper, we design and implement Masa, a responsive memory-aware multi-DNN execution framework, an on-device middleware featuring on modeling inter- and intra-network dependency and leveraging complimentary memory usage of each layer. Masa can consistently ensure the average response time when deterministically and stochastically executing multiple DNN-based image analyses. We extensively evaluate Masa on three configurations of Raspberry Pi and a large set of popular DNN models triggered by different generation patterns of images. Our evaluation results show that Masa can achieve lower average response times by up to 90% on devices with small memory, i.e., 512 MB to 1 GB, compared to the state of the art multi-DNN scheduling solutions.

Masa: Responsive Multi-DNN Inference on the Edge

Cox, Bart;Galjaard, Jeroen;Ghiassi, Amirmasoud;Birke, Robert;Chen, Lydia Y.

2021-01-01

Abstract

Deep neural networks (DNNs) are becoming the core components of many applications running on edge devices, especially for real time image-based analysis. Increasingly, multi-faced knowledge is extracted via executing multiple DNNs inference models, e.g., identifying objects, faces, and genders from images. The response times of multi-DNN highly affect users’ quality of experience and safety as well. Different DNNs exhibit diversified resource requirements and execution patterns across layers and networks, which may easily exceed the available device memory and riskily degrade the responsiveness. In this paper, we design and implement Masa, a responsive memory-aware multi-DNN execution framework, an on-device middleware featuring on modeling inter- and intra-network dependency and leveraging complimentary memory usage of each layer. Masa can consistently ensure the average response time when deterministically and stochastically executing multiple DNN-based image analyses. We extensively evaluate Masa on three configurations of Raspberry Pi and a large set of popular DNN models triggered by different generation patterns of images. Our evaluation results show that Masa can achieve lower average response times by up to 90% on devices with small memory, i.e., 512 MB to 1 GB, compared to the state of the art multi-DNN scheduling solutions.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2021
			
	Titolo dell'evento
	
				IEEE International Conference on Pervasive Computing and Communications
			
	Luogo dell'evento
	
				Kassel, Germany
			
	Data dell'evento
	
				22-26 March 2021
			
	Titolo del volume
	
				IEEE International Conference on Pervasive Computing and Communications
			
	Nome editore
	
				IEEE
			
	Pagine (da)
	
				1
			
	Pagine (a)
	
				10
			
	Codice ISBN
	
				978-1-6654-0418-1
			
	DOI
	
				https://dx.doi.org/10.1109/PERCOM50583.2021.9439111
			
	Parole Chiave
	
				Pervasive computing, Schedules, Image analysis, Processor scheduling, Image edge detection, Real-time systems, Safety
			
	Tutti gli autori
	
						Cox, Bart; Galjaard, Jeroen; Ghiassi, Amirmasoud; Birke, Robert; Chen, Lydia Y.
					
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

File	Dimensione	Formato
2021 PerCom Masa_Responsive_Multi-DNN_Inference_on_the_Edge.pdf Accesso riservato Tipo di file: PDF EDITORIALE Dimensione 1.08 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.08 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
PERCOM_2021___Masa.pdf Accesso aperto Tipo di file: POSTPRINT (VERSIONE FINALE DELL’AUTORE) Dimensione 696.14 kB Formato Adobe PDF Visualizza/Apri	696.14 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1891100

Citazioni

ND

25

22

social impact