CINECA IRIS Institutional Research Information System

We introduce HateBERT, a re-trained BERT model for abusive language detection in English. The model was trained on RAL-E, a large-scale dataset of Reddit comments in English from communities banned for being offensive, abusive, or hateful that we have curated and made available to the public. We present the results of a detailed comparison between a general pre-trained language model and the retrained version on three English datasets for offensive, abusive language and hate speech detection tasks. In all datasets, HateBERT outperforms the corresponding general BERT model. We also discuss a battery of experiments comparing the portability of the fine-tuned models across the datasets, suggesting that portability is affected by compatibility of the annotated phenomena.

HateBERT: Retraining BERT for Abusive Language Detection in English

Tommaso Caselli;Valerio Basile;Jelena Mitrovic;Michael Granitzer

2021-01-01

Abstract

We introduce HateBERT, a re-trained BERT model for abusive language detection in English. The model was trained on RAL-E, a large-scale dataset of Reddit comments in English from communities banned for being offensive, abusive, or hateful that we have curated and made available to the public. We present the results of a detailed comparison between a general pre-trained language model and the retrained version on three English datasets for offensive, abusive language and hate speech detection tasks. In all datasets, HateBERT outperforms the corresponding general BERT model. We also discuss a battery of experiments comparing the portability of the fine-tuned models across the datasets, suggesting that portability is affected by compatibility of the annotated phenomena.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2021
			
	Titolo dell'evento
	
				5th Workshop on Online Abuse and Harms (WOAH 2021)
			
	Luogo dell'evento
	
				Bangkok, Thailand
			
	Data dell'evento
	
				6//2021
			
	Titolo del volume
	
				Proceedings of the 5th Workshop on Online Abuse and Harms (WOAH 2021)
			
	Nome editore
	
				Association for Computational Linguistics
			
	Pagine (da)
	
				17
			
	Pagine (a)
	
				25
			
	DOI
	
				https://dx.doi.org/10.18653/v1/2021.woah-1.3
			
	URL del prodotto (archivi open access, fulltext su sito editore, etc.)
	
				https://aclanthology.org/2021.woah-1.3
			
	Tutti gli autori
	
						Tommaso Caselli;
Valerio Basile;
Jelena Mitrovic;
Michael Granitzer
					
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

File	Dimensione	Formato
2021.woah-1.3.pdf Accesso aperto Descrizione: Articolo principale Tipo di file: PDF EDITORIALE Dimensione 575.41 kB Formato Adobe PDF Visualizza/Apri	575.41 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1824371

Citazioni

ND

133

57

social impact