CINECA IRIS Institutional Research Information System

In this paper we describe a deep learning model based on a Convolutional Neural Network (CNN). The model was developed for the Profiling Hate Speech Spreaders (HSSs) task proposed by PAN 2021 organizers and hosted at the 2021 CLEF Conference. Our approach to the task of classifying an author as HSS or not (nHSS) takes advantage of a CNN based on a single convolutional layer. In this binary classification task, on the tests performed using a 5-fold cross validation, the proposed model reaches a maximum accuracy of 0.80 on the multilingual (i.e., English and Spanish) training set, and a minimum loss value of 0.51 on the same set. As announced by the task organizers, the trained model presented is able to reach an overall accuracy of 0.79 on the full test set. This overall accuracy is obtained averaging the accuracy achieved by the model on both languages. In particular, with regard to the Spanish test set, our model achieves an accuracy of 0.85, while on the English test set the same model achieved an accuracy of 0.73. Thanks to the model presented in this paper, our team won the 2021 PAN competition on profiling HSSs.

Detection of Hate Speech Spreaders using Convolutional Neural Networks

Siino Marco^First;Di Nuovo Elisa;Ilenia Tinnirello;Marco La Cascia

2021-01-01

Abstract

In this paper we describe a deep learning model based on a Convolutional Neural Network (CNN). The model was developed for the Profiling Hate Speech Spreaders (HSSs) task proposed by PAN 2021 organizers and hosted at the 2021 CLEF Conference. Our approach to the task of classifying an author as HSS or not (nHSS) takes advantage of a CNN based on a single convolutional layer. In this binary classification task, on the tests performed using a 5-fold cross validation, the proposed model reaches a maximum accuracy of 0.80 on the multilingual (i.e., English and Spanish) training set, and a minimum loss value of 0.51 on the same set. As announced by the task organizers, the trained model presented is able to reach an overall accuracy of 0.79 on the full test set. This overall accuracy is obtained averaging the accuracy achieved by the model on both languages. In particular, with regard to the Spanish test set, our model achieves an accuracy of 0.85, while on the English test set the same model achieved an accuracy of 0.73. Thanks to the model presented in this paper, our team won the 2021 PAN competition on profiling HSSs.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2021
			
	Lingua di pubblicazione
	
				Inglese
			
	Su invito
	
				contributo
			
	Tipo di evento
	
				1 - Conferenza
			
	Titolo dell'evento
	
				PAN 2021 Profiling Hate Speech Spreaders on Twitter @ CLEF
			
	Luogo dell'evento
	
				Bucharest (online)
			
	Data dell'evento
	
				21-24 settembre 2021
			
	Rilevanza dell'evento
	
				Internazionale
			
	Curatore del volume
	
				Guglielmo Faggioli, Nicola Ferro, Alexis Joly, Maria Maistro, Florina Piroi
			
	Titolo del volume
	
				CLEF 2021 Working Notes
			
	Referee
	
				Comitato scientifico
			
	Nome editore
	
				CEUR
			
	Città editore
	
				Aachen
			
	Nazione editore
	
				GERMANIA
			
	N. Volume
	
				2936
			
	Pagine (da)
	
				2126
			
	Pagine (a)
	
				2136
			
	Numero di Pagine
	
				11
			
	Titolo della serie (se presente ISSN)
	
				CEUR WORKSHOP PROCEEDINGS
			
	Codice Scopus
	
				2-s2.0-85113436218
			
	URL del prodotto (archivi open access, fulltext su sito editore, etc.)
	
				http://ceur-ws.org/Vol-2936/paper-189.pdf
			
	Parole Chiave
	
				Hate Speech, Deep Learning, Author Profiling, Convolutional Neural Network, Word Embedding
			
	Coautori affiliati a enti stranieri
	
				no
			
	Prodotto conforme al Regolamento di Ateneo sull'accesso aperto?
	
				4 – prodotto già presente in altro archivio Open Access (arXiv, REPEC…)
			
	Numero autori
	
				4
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Tipologia
	
				04-CONTRIBUTO IN ATTI DI CONVEGNO::04A-Conference paper in volume
			
	Tutti gli autori
	
						Siino Marco, Di Nuovo Elisa, Ilenia Tinnirello, Marco La Cascia
					
	Tipologia sito docente
	
				273
			
	Fulltext
	
				none
			
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1804428

Citazioni

ND

34

ND

social impact