In the last few years large-scale foundational models have shown remarkable performance in computer vision tasks. However, deploying such models in a production environment poses a significant challenge, because of their computational requirements. Furthermore, these models typically produce generic results and they often need some sort of external input. The concept of knowledge distillation provides a promising solution to this problem. By leveraging the teacher-student framework, the smaller”student” model learns to mimic the larger”teacher” model. In this paper, we focus on the challenges faced in the application of such techniques in the task of augmenting an object detection dataset used in a commercial Visual Recommender System that needs to detect items in various e-commerce websites, encompassing a wide range of product categories. We also present a simple solution to the problems we identified and propose a possible direction of future works.

Knowledge Distillation for a Domain-Adaptive Visual Recommender System

Abluton A.
2024-01-01

Abstract

In the last few years large-scale foundational models have shown remarkable performance in computer vision tasks. However, deploying such models in a production environment poses a significant challenge, because of their computational requirements. Furthermore, these models typically produce generic results and they often need some sort of external input. The concept of knowledge distillation provides a promising solution to this problem. By leveraging the teacher-student framework, the smaller”student” model learns to mimic the larger”teacher” model. In this paper, we focus on the challenges faced in the application of such techniques in the task of augmenting an object detection dataset used in a commercial Visual Recommender System that needs to detect items in various e-commerce websites, encompassing a wide range of product categories. We also present a simple solution to the problems we identified and propose a possible direction of future works.
2024
2023 International Conference of the Italian Association for Artificial Intelligence Doctoral Consortium, AIxIA-DC 2023
ita
2023
CEUR Workshop Proceedings
CEUR-WS
3670
1
6
https://ceur-ws.org/Vol-3670/paper91.pdf
Computer Vision; Knowledge Distillation; Object Detection; Visual Search
Abluton A.
File in questo prodotto:
File Dimensione Formato  
paper91.pdf

Accesso aperto

Tipo di file: PDF EDITORIALE
Dimensione 2.29 MB
Formato Adobe PDF
2.29 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/2071770
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact