The presence and integration of immigrants is one of the most controversial issues in our society, and given current worldwide political instabilities, it will likely become ever more prominent in the cultural and political debate. Social media play an increasingly important role in how citizens debate opinions and react to local and global events. However, several studies point out the danger of social media as a breeding ground for online hate speech (or cyberhate). We propose a novel approach to the exploratory analysis of social phenomena based on the integration of automatic detection of cyberhate against immigrants with offline indicators. We gathered data from the Italian Twittersphere and from the main supplier of official statistical data in Italy (ISTAT). We developed a supervised classification model for hate speech detection, trained on a corpus of Italian tweets manually annotated for hate speech against immigrants, and use it to automatically annotate a large sample of geo-tagged tweets over a span of six years. We crossed this data with the ISTAT data, exploring three macro-indicators related to employment, education and crime. We found correlations suggesting an interplay between economical and cultural factors and the expression of hate online

Leveraging Hate Speech Detection to Investigate Immigration-related Phenomena in Italy

Florio, Komal;Basile, Valerio;Lai, Mirko;Patti, Viviana
2019-01-01

Abstract

The presence and integration of immigrants is one of the most controversial issues in our society, and given current worldwide political instabilities, it will likely become ever more prominent in the cultural and political debate. Social media play an increasingly important role in how citizens debate opinions and react to local and global events. However, several studies point out the danger of social media as a breeding ground for online hate speech (or cyberhate). We propose a novel approach to the exploratory analysis of social phenomena based on the integration of automatic detection of cyberhate against immigrants with offline indicators. We gathered data from the Italian Twittersphere and from the main supplier of official statistical data in Italy (ISTAT). We developed a supervised classification model for hate speech detection, trained on a corpus of Italian tweets manually annotated for hate speech against immigrants, and use it to automatically annotate a large sample of geo-tagged tweets over a span of six years. We crossed this data with the ISTAT data, exploring three macro-indicators related to employment, education and crime. We found correlations suggesting an interplay between economical and cultural factors and the expression of hate online
Emotion and Sentiment in Social and Expressive Media: Media and Arts for Inclusive, Fair and Reflective Societies, Co-located with ACII 2019, the 8th International Conference on Affective Computing and Intelligent Interaction (ESSEM@ACII 2019)
Cambridge, United Kingdom
September 3, 2019
2019 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW)
IEEE
1
7
978-1-7281-3891-6
https://ieeexplore.ieee.org/document/8925079
NLP Applications and Tools, Hate Speech Detection, Social Sciences, Hate Speech against Immigrants, Twitter
Florio, Komal; Basile, Valerio; Lai, Mirko; Patti, Viviana
File in questo prodotto:
File Dimensione Formato  
08925079.pdf

Accesso riservato

Tipo di file: PDF EDITORIALE
Dimensione 6.81 MB
Formato Adobe PDF
6.81 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1721734
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 6
  • ???jsp.display-item.citation.isi??? ND
social impact