The data collected from smart devices, the Internet of Things (IoT), and Smart Homes can be used for mining purposes and potentially benefit organizations with a large user base. The data collected from personal devices is intrinsically private and should be collected through a privacy-guaranteed mechanism. Local differential privacy solves privacy problems by collecting randomized responses from each user, and it does not need to rely on a trusted data aggregator/curator. It allows for building reliable prediction models on the collected amount of randomized data. The proposed approach utilizes the randomized response technique in a novel manner: it guarantees privacy to users during the data collection and simultaneously preserves the high utility of the analysis. It can be seen as a case of synthetic data generation by producing contingency tables (marginals) in a privacy-preserving mechanism. This article describes the proposed randomized response technique and discusses the motivating applications domains. It justifies why it satisfies the property of differential privacy and utility guarantees theoretically and through experimental analysis with excellent results.

Collection and Analysis of Sensitive Data with Privacy Protection by a Distributed Randomized Response Protocol

Faisal Imran;Rosa Meo
2024-01-01

Abstract

The data collected from smart devices, the Internet of Things (IoT), and Smart Homes can be used for mining purposes and potentially benefit organizations with a large user base. The data collected from personal devices is intrinsically private and should be collected through a privacy-guaranteed mechanism. Local differential privacy solves privacy problems by collecting randomized responses from each user, and it does not need to rely on a trusted data aggregator/curator. It allows for building reliable prediction models on the collected amount of randomized data. The proposed approach utilizes the randomized response technique in a novel manner: it guarantees privacy to users during the data collection and simultaneously preserves the high utility of the analysis. It can be seen as a case of synthetic data generation by producing contingency tables (marginals) in a privacy-preserving mechanism. This article describes the proposed randomized response technique and discusses the motivating applications domains. It justifies why it satisfies the property of differential privacy and utility guarantees theoretically and through experimental analysis with excellent results.
2024
ACM Symposium of Applied Computing
Avila, Spain
8-12 April 2024
Proceedings of the 39th ACM/SIGAPP Symposium On Applied Computing
ACM SIGAPP
1414
1423
979-8-4007-0243-3
https://www.sigapp.org/sac/sac2024/index.php
https://www.uni-saarland.de/lehrstuhl/sorge/forschung/workshopskonferenzen/acm-sac-2024-track-on-privacy-by-design-in-practice.html
https://dl.acm.org/
Randomized Response; Local Differential Privacy; Contingency Tables; Privacy protection; Distributed computation protocol
Faisal Imran; Rosa Meo
File in questo prodotto:
File Dimensione Formato  
ACM_SAC_2024__Track_on_Privacy_by_Design_in_Practice_CR.pdf

Accesso aperto

Dimensione 2.24 MB
Formato Adobe PDF
2.24 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1947613
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact