CINECA IRIS Institutional Research Information System

Latent Diffusion Models have recently emerged as the state-of-the-art approach for synthetic image generation. In the Web context, their adoption may significantly impact the way it is currently approached, from both sides of content generation and exploration. For example, future Web platforms may create alternative and personalised images for individual users or improve the accessibility for users with disabilities. However, due to the nascent stage of this research area, there remains a knowledge gap in effectively utilising these models, which can clutter the digital space with poor-quality AI-generated, thus diminishing the overall perceived impact and the user experience. To address this issue, we propose a novel methodology aimed at generating high-quality prompts with minimal user effort. In particular, we present BLACK (Background, Lighting, Amenities, Context, and Kinesis), a prompt generation model directly designed for achieving high-quality images satisfying a proposed set of five desiderata. Through concrete examples, we demonstrate the impact of the prompting model in improving the generation quality. As a second contribution, we publicly release a structured resource of prompts along with expected results.

Paint it, BLACK: a Novel Methodology for Prompting

Federico Torrielli^First

2023-01-01

Abstract

Latent Diffusion Models have recently emerged as the state-of-the-art approach for synthetic image generation. In the Web context, their adoption may significantly impact the way it is currently approached, from both sides of content generation and exploration. For example, future Web platforms may create alternative and personalised images for individual users or improve the accessibility for users with disabilities. However, due to the nascent stage of this research area, there remains a knowledge gap in effectively utilising these models, which can clutter the digital space with poor-quality AI-generated, thus diminishing the overall perceived impact and the user experience. To address this issue, we propose a novel methodology aimed at generating high-quality prompts with minimal user effort. In particular, we present BLACK (Background, Lighting, Amenities, Context, and Kinesis), a prompt generation model directly designed for achieving high-quality images satisfying a proposed set of five desiderata. Through concrete examples, we demonstrate the impact of the prompting model in improving the generation quality. As a second contribution, we publicly release a structured resource of prompts along with expected results.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Titolo dell'evento
	
				GENerative, Explainable and Reasonable Artificial Learning Workshop
			
	Luogo dell'evento
	
				Torino, Italy
			
	Data dell'evento
	
				20 September 2023
			
	Titolo del volume
	
				Proceedings of the Workshop on GENerative, Explainable and Reasonable Artificial Learning co-located with the 15th Biannual Conference of the Italian SIGCHI Chapter (CHITALY 2023)
			
	Nome editore
	
				Federico Torrielli, Amon Rapp, Luigi Di Caro
			
	Pagine (da)
	
				3
			
	Pagine (a)
	
				11
			
	URL del prodotto (archivi open access, fulltext su sito editore, etc.)
	
				https://ceur-ws.org/Vol-3571/short1.pdf
			
	Parole Chiave
	
				latent diffusion model, prompt engineering, image generation, generative ai, generative artificial intelligence
			
	Tutti gli autori
	
						Federico Torrielli
					
	Appare nelle tipologie:
	
				04A-Conference paper in volume

File in questo prodotto:

File	Dimensione	Formato
short1.pdf Accesso aperto Tipo di file: PDF EDITORIALE Dimensione 11.26 MB Formato Adobe PDF Visualizza/Apri	11.26 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1947172

Citazioni

ND

0

ND

social impact