Semi-supervised learning has shown its potential in many real-world applications where only few labeled examples are available. However, when some fairness constraints need to be satisfied, semisupervised classification models often struggle as they are required to cope with the lack of sufficient information for predicting the target variable while forgetting its relationships with any sensitive and potentially discriminatory attribute. To address this issue, we propose a fair semi-supervised representation learning architecture that leads to fair and accurate classification results even in very challenging scenarios with few labeled (but biased) instances. We show experimentally that our model can be easily adopted in very general settings, as the learned representations may be employed to train any supervised classifier. Moreover, when applied to several real-world datasets, our method is competitive with state-of-the-art fair semi-supervised approaches.

Fair Semi-supervised Representation Learning for Tabular Data Classification

Shuyi Yang
Co-first
;
Mattia Cerrato
Co-first
;
Dino Ienco;Ruggero G. Pensa
Co-last
;
Roberto Esposito
Co-last
2023-01-01

Abstract

Semi-supervised learning has shown its potential in many real-world applications where only few labeled examples are available. However, when some fairness constraints need to be satisfied, semisupervised classification models often struggle as they are required to cope with the lack of sufficient information for predicting the target variable while forgetting its relationships with any sensitive and potentially discriminatory attribute. To address this issue, we propose a fair semi-supervised representation learning architecture that leads to fair and accurate classification results even in very challenging scenarios with few labeled (but biased) instances. We show experimentally that our model can be easily adopted in very general settings, as the learned representations may be employed to train any supervised classifier. Moreover, when applied to several real-world datasets, our method is competitive with state-of-the-art fair semi-supervised approaches.
2023
31st Symposium of Advanced Database Systems (SEBD 2023)
Galzignano Terme, Italy
July 2-5, 2023
Proceedings of the 31st Symposium of Advanced Database Systems (SEBD 2023)
CEUR-WS.org
3478
488
496
https://ceur-ws.org/Vol-3478/paper51.pdf
semi-supervised autoencoder, fairness, deep neural networks
Shuyi Yang, Mattia Cerrato, Dino Ienco, Ruggero G. Pensa, Roberto Esposito
File in questo prodotto:
File Dimensione Formato  
sebd2023_online.pdf

Accesso aperto

Descrizione: PDF online
Tipo di file: PDF EDITORIALE
Dimensione 1.03 MB
Formato Adobe PDF
1.03 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1931555
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact