CINECA IRIS Institutional Research Information System

Hate speech recognizers may mislabel sentences by not considering the different opinions that society has on selected topics. In this paper, we show how explainable machine learning models based on syntax can help to understand the motivations that induce a sentence to be offensive to a certain demographic group. To explore this hypothesis, we use several syntax-based neural networks, which are equipped with syntax heat analysis trees used as a post-hoc explanation of the classifications and a dataset annotated by two different groups having dissimilar cultural backgrounds. Using particular contrasting trees, we compared the results and showed the differences. The results show how the keywords that make a sentence offensive depend on the cultural background of the annotators and how this differs in different fields. In addition, the syntactic activations show how even the sub-trees are very relevant in the classification phase.

Change My Mind: how Syntax-based Hate Speech Recognizer can Uncover Hidden Motivations based on Different Viewpoints

Mastromattei Michele;Basile Valerio;Zanzotto Fabio Massimo

2022-01-01

Abstract

Hate speech recognizers may mislabel sentences by not considering the different opinions that society has on selected topics. In this paper, we show how explainable machine learning models based on syntax can help to understand the motivations that induce a sentence to be offensive to a certain demographic group. To explore this hypothesis, we use several syntax-based neural networks, which are equipped with syntax heat analysis trees used as a post-hoc explanation of the classifications and a dataset annotated by two different groups having dissimilar cultural backgrounds. Using particular contrasting trees, we compared the results and showed the differences. The results show how the keywords that make a sentence offensive depend on the cultural background of the annotators and how this differs in different fields. In addition, the syntactic activations show how even the sub-trees are very relevant in the classification phase.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Titolo dell'evento
	
				1st Workshop on Perspectivist Approaches to Disagreement in NLP, NLPerspectives 2022
			
	Luogo dell'evento
	
				Francia
			
	Data dell'evento
	
				2022
			
	Titolo del volume
	
				1st Workshop on Perspectivist Approaches to Disagreement in NLP, NLPerspectives 2022 as part of Language Resources and Evaluation Conference, LREC 2022 Workshop
			
	Nome editore
	
				European Language Resources Association (ELRA)
			
	Pagine (da)
	
				117
			
	Pagine (a)
	
				125
			
	Codice ISBN
	
				979-10-95546-98-6
			
	URL del prodotto (archivi open access, fulltext su sito editore, etc.)
	
				https://aclanthology.org/2022.nlperspectives-1.15.pdf
			
	Parole Chiave
	
				Explainable models; Hate speech recognizer; Perspectivism
			
	Tutti gli autori
	
						Mastromattei Michele; Basile Valerio; Zanzotto Fabio Massimo
					
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

File	Dimensione	Formato
2022.nlperspectives-1.15.pdf Accesso aperto Descrizione: articolo principale Tipo di file: PDF EDITORIALE Dimensione 567.44 kB Formato Adobe PDF Visualizza/Apri	567.44 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1887747

Citazioni

ND

6

ND

social impact