Detection of Privacy-Harming Social Media Posts in Italian

Peiretti, Federico; Pensa, Ruggero G.

doi:10.1007/978-981-99-5177-2_12

As many psychological and sociological study reveal, many people disclose too much privacy-harming information in social media in the form of text and multimedia posts, thus exposing themselves and other persons to several security risks. Consequently, many researchers have addressed this problem by investigating on the detection and analysis of the so-called self-disclosure behavior in social media and blogging platforms. Among the others, content sensitivity analysis has emerged as a promising research direction, but, so far, it has only focused on English text posts, although it is well-known that people tend to disclose mostly in their own native languages. Therefore, in this paper, we address this limitation by proposing a new text corpus of Italian posts that we have annotated following to the anonymity assumption. We then apply several language models based on transformers to classify them according to their sensitivity. Moreover, since Italian is a lower-resource language compared to English, we also apply some multilingual zero-shot transfer learning architectures trained on a rich and manually annotated English corpus and tested on the Italian one. We show experimentally that the approaches trained directly on the Italian corpus, still outperform multilingual ones trained on the English data and tested on Italian, although some of them exhibit promising prediction performances.

Detection of Privacy-Harming Social Media Posts in Italian

Peiretti, Federico^First;Pensa, Ruggero G.^Last

2023-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Titolo dell'evento
	
				9th International Symposium on Security and Privacy in Social Networks and Big Data (SocialSec 2023)
			
	Luogo dell'evento
	
				University of Kent, Canterbury, UK
			
	Data dell'evento
	
				August 14-16, 2023
			
	Titolo del volume
	
				SocialSec 2023: Security and Privacy in Social Networks and Big Data
			
	Nome editore
	
				Springer
			
	N. Volume
	
				14097
			
	Pagine (da)
	
				203
			
	Pagine (a)
	
				223
			
	Codice ISBN
	
				978-981-99-5176-5
978-981-99-5177-2
			
	DOI
	
				https://dx.doi.org/10.1007/978-981-99-5177-2_12
			
	URL del prodotto (archivi open access, fulltext su sito editore, etc.)
	
				https://link.springer.com/chapter/10.1007/978-981-99-5177-2_12
			
	Parole Chiave
	
				Privacy, Neural language models, Social media
			
	Tutti gli autori
	
						Peiretti, Federico; Pensa, Ruggero G.
					
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

File	Dimensione	Formato
main.pdf Accesso aperto Descrizione: preprint Tipo di file: PREPRINT (PRIMA BOZZA) Dimensione 448.44 kB Formato Adobe PDF Visualizza/Apri	448.44 kB	Adobe PDF	Visualizza/Apri
socialsec2023_printed.pdf Accesso riservato Descrizione: PDF editoriale Tipo di file: PDF EDITORIALE Dimensione 212.68 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	212.68 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1925050

Nome	Dominio	Durata	Descrizione
s_.*	plu.mx	sessione	recupero grafico citazioni sociali da plumx
A_.*	core.ac.uk	7 giorni	recupero pubblicazioni consigliate per il pannello core-recommander
GS_.*	gstatic.com	richiesta http	visualizza grafico citazioni
CC_.*	creativecommons.org	richiesta http	visualizza licenza bitstream

Detection of Privacy-Harming Social Media Posts in Italian

Peiretti, Federico^First;Pensa, Ruggero G.^Last

First

Last

2023-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

CINECA IRIS Institutional Research Information System

Detection of Privacy-Harming Social Media Posts in Italian

Peiretti, FedericoFirst;Pensa, Ruggero G. Last

First

Last

2023-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Peiretti, Federico^First;Pensa, Ruggero G.^Last

Scheda breve

Scheda completa

Scheda completa (DC)