CINECA IRIS Institutional Research Information System

Supervised machine learning, in particular in Natural Language Processing, is based on the creation of high-quality gold standard datasets for training and benchmarking. The de-facto standard annotation methodologies work well for traditionally relevant tasks in Computational Linguistics. However, critical issues are surfacing when applying old techniques to the study of highly subjective phenomena such as irony and sarcasm, or abusive and offensive language. This paper calls for a paradigm shift, away from monolithic, majority-aggregated gold standards, and towards an inclusive framework that preserves the personal opinions and culturally-driven perspectives of the annotators. New training sets and supervised machine learning techniques will have to be adapted in order to create fair, inclusive, and ultimately more informed models of subjective semantic and pragmatic phenomena. The arguments are backed by a synthetic experiment showing the lack of correlation between the difficulty of an annotation task, its degree of subjectivity, and the quality of the predictions of a supervised classifier trained on the resulting data.

It’s the end of the gold standard as we know it. On the impact of pre-aggregation on the evaluation of highly subjective tasks

Basile V.^First

2020-01-01

Abstract

Supervised machine learning, in particular in Natural Language Processing, is based on the creation of high-quality gold standard datasets for training and benchmarking. The de-facto standard annotation methodologies work well for traditionally relevant tasks in Computational Linguistics. However, critical issues are surfacing when applying old techniques to the study of highly subjective phenomena such as irony and sarcasm, or abusive and offensive language. This paper calls for a paradigm shift, away from monolithic, majority-aggregated gold standards, and towards an inclusive framework that preserves the personal opinions and culturally-driven perspectives of the annotators. New training sets and supervised machine learning techniques will have to be adapted in order to create fair, inclusive, and ultimately more informed models of subjective semantic and pragmatic phenomena. The arguments are backed by a synthetic experiment showing the lack of correlation between the difficulty of an annotation task, its degree of subjectivity, and the quality of the predictions of a supervised classifier trained on the resulting data.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2020
			
	Lingua di pubblicazione
	
				Inglese
			
	Su invito
	
				contributo
			
	Tipo di evento
	
				1 - Conferenza
			
	Titolo dell'evento
	
				2020 AIxIA Discussion Papers Workshop, AIxIA 2020 DP
			
	Luogo dell'evento
	
				Anywhere
			
	Data dell'evento
	
				2020
			
	Titolo del volume
	
				CEUR Workshop Proceedings
			
	Referee
	
				Esperti anonimi
			
	Nome editore
	
				CEUR-WS
			
	Città editore
	
				Torino
			
	Nazione editore
	
				ITALIA
			
	N. Volume
	
				2776
			
	Pagine (da)
	
				31
			
	Pagine (a)
	
				40
			
	Numero di Pagine
	
				10
			
	Titolo della serie (se presente ISSN)
	
				CEUR WORKSHOP PROCEEDINGS
			
	Codice Scopus
	
				2-s2.0-85098196296
			
	URL del prodotto (archivi open access, fulltext su sito editore, etc.)
	
				http://ceur-ws.org/Vol-2776/paper-4.pdf
			
	Parole Chiave
	
				Inclusive Machine Learning; Linguistic Annotation; Subjectivity
			
	Coautori affiliati a enti stranieri
	
				no
			
	Prodotto conforme al Regolamento di Ateneo sull'accesso aperto?
	
				1 – prodotto con  file in versione Open Access (allegherò il file al passo 6 - Carica)
			
	Numero autori
	
				1
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Tipologia
	
				04-CONTRIBUTO IN ATTI DI CONVEGNO::04A-Conference paper in volume
			
	Tutti gli autori
	
						Basile V.
					
	Tipologia sito docente
	
				273
			
	Fulltext
	
				open
			
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

File	Dimensione	Formato
paper-4.pdf Accesso aperto Tipo di file: PDF EDITORIALE Dimensione 843.25 kB Formato Adobe PDF Visualizza/Apri	843.25 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1770149

Citazioni

ND

26

ND

social impact