CINECA IRIS Institutional Research Information System

Works in perspectivism and human label variation have emphasized the need to collect and leverage various voices and points of view in the whole Natural Language Processing pipeline. PERSEID places itself in this line of work. We consider the task of irony detection from short social media conversations in Italian collected from Twitter (X) and Reddit. To do so, we leverage data from MultiPICO, a recent multilingual dataset with disaggregated annotations and annotators' metadata, containing 1000 Post, Reply pairs with five annotations each on average. We aim to evaluate whether prompting LLMs with additional annotators' demographic information (namely gender only, age only, and the combination of the two) results in improved performance compared to a baseline in which only the input text is provided. The evaluation is zero-shot; and we evaluate the results on the disaggregated annotations using f1.

PERSEID - Perspectivist Irony Detection: A CALAMITA Challenge

Basile V.;Casola S.;Frenda S.;Lo S. M.

2024-01-01

Abstract

Works in perspectivism and human label variation have emphasized the need to collect and leverage various voices and points of view in the whole Natural Language Processing pipeline. PERSEID places itself in this line of work. We consider the task of irony detection from short social media conversations in Italian collected from Twitter (X) and Reddit. To do so, we leverage data from MultiPICO, a recent multilingual dataset with disaggregated annotations and annotators' metadata, containing 1000 Post, Reply pairs with five annotations each on average. We aim to evaluate whether prompting LLMs with additional annotators' demographic information (namely gender only, age only, and the combination of the two) results in improved performance compared to a baseline in which only the input text is provided. The evaluation is zero-shot; and we evaluate the results on the disaggregated annotations using f1.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Titolo dell'evento
	
				10th Italian Conference on Computational Linguistics, CLiC-it 2024
			
	Luogo dell'evento
	
				ita
			
	Data dell'evento
	
				2024
			
	Titolo del volume
	
				Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024), Pisa, Italy, December 4-6, 2024
			
	Nome editore
	
				CEUR-WS
			
	N. Volume
	
				3878
			
	Pagine (da)
	
				1074
			
	Pagine (a)
	
				1081
			
	URL del prodotto (archivi open access, fulltext su sito editore, etc.)
	
				https://aclanthology.org/2024.clicit-1.118.pdf
			
	Parole Chiave
	
				Evaluation; Irony Detection; Perspectivism
			
	Tutti gli autori
	
						Basile V.; Casola S.; Frenda S.; Lo S.M.
					
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

File	Dimensione	Formato
2024.clicit-1.118.pdf Accesso aperto Tipo di file: PDF EDITORIALE Dimensione 359.04 kB Formato Adobe PDF Visualizza/Apri	359.04 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/2084140

Citazioni

ND

0

ND

social impact