CINECA IRIS Institutional Research Information System

Achieving factual accuracy is a known pending issue for language models. Their design centered around the interactive component of user interaction and the extensive use of “spontaneous” training data, has made them highly adept at conversational tasks but not fully reliable in terms of factual correctness. VeryfIT addresses this issue by evaluating the in-memory factual knowledge of language models on data written by professional fact-checkers, posing it as a true or false question. Topics of the statements vary but most are in specific domains related to the Italian government, policies, and social issues. The task presents several challenges: extracting statements from segments of speeches, determining appropriate contextual relevance both temporally and factually, and ultimately verifying the accuracy of the statements.

VeryfIT - Benchmark of Fact-Checked Claims for Italian: A CALAMITA Challenge

Gili J.;Patti V.;Passaro L.;Caselli T.

2024-01-01

Abstract

Achieving factual accuracy is a known pending issue for language models. Their design centered around the interactive component of user interaction and the extensive use of “spontaneous” training data, has made them highly adept at conversational tasks but not fully reliable in terms of factual correctness. VeryfIT addresses this issue by evaluating the in-memory factual knowledge of language models on data written by professional fact-checkers, posing it as a true or false question. Topics of the statements vary but most are in specific domains related to the Italian government, policies, and social issues. The task presents several challenges: extracting statements from segments of speeches, determining appropriate contextual relevance both temporally and factually, and ultimately verifying the accuracy of the statements.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Titolo dell'evento
	
				10th Italian Conference on Computational Linguistics, CLiC-it 2024
			
	Luogo dell'evento
	
				Pisa, Italia
			
	Data dell'evento
	
				2024
			
	Titolo del volume
	
				Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024), Pisa, Italy, December 4-6, 2024
			
	Nome editore
	
				CEUR-WS
			
	N. Volume
	
				3878
			
	Pagine (da)
	
				1
			
	Pagine (a)
	
				9
			
	URL del prodotto (archivi open access, fulltext su sito editore, etc.)
	
				https://ceur-ws.org/Vol-3878/123_calamita_long.pdf
			
	Parole Chiave
	
				benchmark; CALAMITA; CheckIT!; fact checking; factual knowledge; fake news; Italian
			
	Tutti gli autori
	
						Gili J.; Patti V.; Passaro L.; Caselli T.
					
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

File	Dimensione	Formato
123_calamita_long.pdf Accesso aperto Tipo di file: PDF EDITORIALE Dimensione 1.25 MB Formato Adobe PDF Visualizza/Apri	1.25 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/2059279

Citazioni

ND

0

ND

social impact