Standard supervised classification methods make the assumption that the training data is fully annotated thus requiring an a-priory labelling process which is both costly and time-consuming. To relax this requirement, many different flavors of weakly supervised learning have been proposed. Among weakly supervised learning strategies, Positive Unlabelled learning (PUL) is gaining attention from the research community due to the wide spectrum of applications it can fit. However, the majority of research studies related to PUL only consider binary classification tasks while real-world applications commonly involve multiple categories. To deal with this limitation, Multi-Positive Unlabelled learning (MPUL) has been recently introduced to learn from examples labelled with multiple positive labels and a single unknown negative label. Up to today, only a limited number of research works were proposed to cope with this more general setting. In this paper, we propose a new MPUL framework based on deep learning strategies. Our framework, named ProtoMPUL (Prototype based Multi-Positive and Unlabelled Learning), combines metric learning and clustering strategies to model the set of positive classes as well as to characterize the unknown negative one. Experimental evaluations on real-world benchmarks considering recent MPUL com- petitors demonstrates that the proposed framework achieves state-of-the-art performances, thus supporting the validity of the proposed approach.

Dealing With Multipositive Unlabeled Learning Combining Metric Learning and Deep Clustering

Esposito R.;Ienco D.
2022-01-01

Abstract

Standard supervised classification methods make the assumption that the training data is fully annotated thus requiring an a-priory labelling process which is both costly and time-consuming. To relax this requirement, many different flavors of weakly supervised learning have been proposed. Among weakly supervised learning strategies, Positive Unlabelled learning (PUL) is gaining attention from the research community due to the wide spectrum of applications it can fit. However, the majority of research studies related to PUL only consider binary classification tasks while real-world applications commonly involve multiple categories. To deal with this limitation, Multi-Positive Unlabelled learning (MPUL) has been recently introduced to learn from examples labelled with multiple positive labels and a single unknown negative label. Up to today, only a limited number of research works were proposed to cope with this more general setting. In this paper, we propose a new MPUL framework based on deep learning strategies. Our framework, named ProtoMPUL (Prototype based Multi-Positive and Unlabelled Learning), combines metric learning and clustering strategies to model the set of positive classes as well as to characterize the unknown negative one. Experimental evaluations on real-world benchmarks considering recent MPUL com- petitors demonstrates that the proposed framework achieves state-of-the-art performances, thus supporting the validity of the proposed approach.
2022
10
51839
51849
https://ieeexplore.ieee.org/document/9773176
Multi-positive unlabelled learning, weakly supervised learning, tabular data, metric learning, deep clustering
Racanati A.; Esposito R.; Ienco D.
File in questo prodotto:
File Dimensione Formato  
Dealing_With_Multipositive_Unlabeled_Learning_Combining_Metric_Learning_and_Deep_Clustering-2.pdf

Accesso aperto

Dimensione 3.28 MB
Formato Adobe PDF
3.28 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1870822
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact