CINECA IRIS Institutional Research Information System

In this paper we describe a deep learning model based on a Convolutional Neural Network (CNN). The model was developed for the Profiling Hate Speech Spreaders (HSSs) task proposed by PAN 2021 organizers and hosted at the 2021 CLEF Conference. Our approach to the task of classifying an author as HSS or not (nHSS) takes advantage of a CNN based on a single convolutional layer. In this binary classification task, on the tests performed using a 5-fold cross validation, the proposed model reaches a maximum accuracy of 0.80 on the multilingual (i.e., English and Spanish) training set, and a minimum loss value of 0.51 on the same set. As announced by the task organizers, the trained model presented is able to reach an overall accuracy of 0.79 on the full test set. This overall accuracy is obtained averaging the accuracy achieved by the model on both languages. In particular, with regard to the Spanish test set, our model achieves an accuracy of 0.85, while on the English test set the same model achieved an accuracy of 0.73. Thanks to the model presented in this paper, our team won the 2021 PAN competition on profiling HSSs.

Detection of Hate Speech Spreaders using Convolutional Neural Networks

Siino Marco^First;Di Nuovo Elisa;Ilenia Tinnirello;Marco La Cascia

2021-01-01

Abstract

In this paper we describe a deep learning model based on a Convolutional Neural Network (CNN). The model was developed for the Profiling Hate Speech Spreaders (HSSs) task proposed by PAN 2021 organizers and hosted at the 2021 CLEF Conference. Our approach to the task of classifying an author as HSS or not (nHSS) takes advantage of a CNN based on a single convolutional layer. In this binary classification task, on the tests performed using a 5-fold cross validation, the proposed model reaches a maximum accuracy of 0.80 on the multilingual (i.e., English and Spanish) training set, and a minimum loss value of 0.51 on the same set. As announced by the task organizers, the trained model presented is able to reach an overall accuracy of 0.79 on the full test set. This overall accuracy is obtained averaging the accuracy achieved by the model on both languages. In particular, with regard to the Spanish test set, our model achieves an accuracy of 0.85, while on the English test set the same model achieved an accuracy of 0.73. Thanks to the model presented in this paper, our team won the 2021 PAN competition on profiling HSSs.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2021
			
	Titolo dell'evento
	
				PAN 2021 Profiling Hate Speech Spreaders on Twitter @ CLEF
			
	Luogo dell'evento
	
				Bucharest (online)
			
	Data dell'evento
	
				21-24 settembre 2021
			
	Titolo del volume
	
				CLEF 2021 Working Notes
			
	Nome editore
	
				CEUR
			
	N. Volume
	
				2936
			
	Pagine (da)
	
				2126
			
	Pagine (a)
	
				2136
			
	URL del prodotto (archivi open access, fulltext su sito editore, etc.)
	
				http://ceur-ws.org/Vol-2936/paper-189.pdf
			
	Parole Chiave
	
				Hate Speech, Deep Learning, Author Profiling, Convolutional Neural Network, Word Embedding
			
	Tutti gli autori
	
						Siino Marco, Di Nuovo Elisa, Ilenia Tinnirello, Marco La Cascia
					
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1804428

Citazioni

ND

50

ND

social impact