Generative Artificial Intelligence (GenAI) text-to-image models have made significant progress in emulating human-like outputs. However, understanding the inner functioning of these models remains a challenge due to their complexity and black-box nature. It has been observed that individuals naturally develop informal conceptualizations, termed “folk theories,” to explain the behaviors of algorithmic systems. The specific nature of GenAI text-to-image models, which are obscure in their working principles, yet carry out activities that are peculiar to humans, makes it interesting to investigate people’s theorization about this technology. With this aim, we conducted a qualitative interview study with 20 participants and observed how they accounted for the outputs of Stable Diffusion. The study findings show that participants developed a wide spectrum of conceptualizations, including folk theories that appear distinctive of GenAI text-to-image technology, also ascribing to the model a variety of “mental states.” Furthermore, we found that theory building follows different inductive and deductive trajectories, with participants employing diverse strategies to explain the functioning of the technology.
How Do People Develop Folk Theories of Generative AI Text-to-Image Models? A Qualitative Study on How People Strive to Explain and Make Sense of GenAI
Di Lodovico, ChiaraFirst
;Torrielli, Federico;Di Caro, Luigi;Rapp, Amon
Last
2025-01-01
Abstract
Generative Artificial Intelligence (GenAI) text-to-image models have made significant progress in emulating human-like outputs. However, understanding the inner functioning of these models remains a challenge due to their complexity and black-box nature. It has been observed that individuals naturally develop informal conceptualizations, termed “folk theories,” to explain the behaviors of algorithmic systems. The specific nature of GenAI text-to-image models, which are obscure in their working principles, yet carry out activities that are peculiar to humans, makes it interesting to investigate people’s theorization about this technology. With this aim, we conducted a qualitative interview study with 20 participants and observed how they accounted for the outputs of Stable Diffusion. The study findings show that participants developed a wide spectrum of conceptualizations, including folk theories that appear distinctive of GenAI text-to-image technology, also ascribing to the model a variety of “mental states.” Furthermore, we found that theory building follows different inductive and deductive trajectories, with participants employing diverse strategies to explain the functioning of the technology.| File | Dimensione | Formato | |
|---|---|---|---|
|
2025b-IJHCI.pdf
Accesso riservato
Tipo di file:
PDF EDITORIALE
Dimensione
1.97 MB
Formato
Adobe PDF
|
1.97 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
|
IJHCI-2025camreadyfinal.pdf
Accesso aperto
Tipo di file:
POSTPRINT (VERSIONE FINALE DELL’AUTORE)
Dimensione
2.33 MB
Formato
Adobe PDF
|
2.33 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



