CINECA IRIS Institutional Research Information System

Several recent works have examined the generations produced by large language models (LLMs) on subjective topics such as political opinions and attitudinal questionnaires. There is growing interest in controlling these outputs to align with specific users or perspectives using model steering techniques. However, several studies have highlighted unintended and unexpected steering effects, where minor changes in the prompt or irrelevant contextual cues influence model-generated opinions. This work empirically tests how irrelevant information can systematically bias model opinions in specific directions. Using the Political Compass Test questionnaire, we conduct a detailed statistical analysis to quantify these shifts using the opinions generated by LLMs in an open-generation setting. The results demonstrate that even seemingly unrelated contexts consistently alter model responses in predictable ways, further highlighting challenges in ensuring the robustness and reliability of LLMs when generating opinions on subjective topics.

Quantifying the Influence of Irrelevant Contexts on Political Opinions Produced by LLMs

D'Avenia, Samuele;Basile, Valerio

2025-01-01

Abstract

Several recent works have examined the generations produced by large language models (LLMs) on subjective topics such as political opinions and attitudinal questionnaires. There is growing interest in controlling these outputs to align with specific users or perspectives using model steering techniques. However, several studies have highlighted unintended and unexpected steering effects, where minor changes in the prompt or irrelevant contextual cues influence model-generated opinions. This work empirically tests how irrelevant information can systematically bias model opinions in specific directions. Using the Political Compass Test questionnaire, we conduct a detailed statistical analysis to quantify these shifts using the opinions generated by LLMs in an open-generation setting. The results demonstrate that even seemingly unrelated contexts consistently alter model responses in predictable ways, further highlighting challenges in ensuring the robustness and reliability of LLMs when generating opinions on subjective topics.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Titolo dell'evento
	
				63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025
			
	Luogo dell'evento
	
				Vienna
			
	Data dell'evento
	
				2025
			
	Titolo del volume
	
				Proceedings of the Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop)
			
	Nome editore
	
				Association for Computational Linguistics (ACL)
			
	N. Volume
	
				4
			
	Pagine (da)
	
				434
			
	Pagine (a)
	
				454
			
	DOI
	
				https://dx.doi.org/10.18653/v1/2025.acl-srw.28
			
	Tutti gli autori
	
						D'Avenia, Samuele; Basile, Valerio
					
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

File	Dimensione	Formato
2025.acl-srw.28.pdf Accesso aperto Tipo di file: PDF EDITORIALE Dimensione 4.71 MB Formato Adobe PDF Visualizza/Apri	4.71 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/2117247

Citazioni

ND

0

ND

social impact