Latent Diffusion Models have recently emerged as the state-of-the-art approach for synthetic image generation. In the Web context, their adoption may significantly impact the way it is currently approached, from both sides of content generation and exploration. For example, future Web platforms may create alternative and personalised images for individual users or improve the accessibility for users with disabilities. However, due to the nascent stage of this research area, there remains a knowledge gap in effectively utilising these models, which can clutter the digital space with poor-quality AI-generated, thus diminishing the overall perceived impact and the user experience. To address this issue, we propose a novel methodology aimed at generating high-quality prompts with minimal user effort. In particular, we present BLACK (Background, Lighting, Amenities, Context, and Kinesis), a prompt generation model directly designed for achieving high-quality images satisfying a proposed set of five desiderata. Through concrete examples, we demonstrate the impact of the prompting model in improving the generation quality. As a second contribution, we publicly release a structured resource of prompts along with expected results.

Paint it, BLACK: a Novel Methodology for Prompting

Federico Torrielli
First
2023-01-01

Abstract

Latent Diffusion Models have recently emerged as the state-of-the-art approach for synthetic image generation. In the Web context, their adoption may significantly impact the way it is currently approached, from both sides of content generation and exploration. For example, future Web platforms may create alternative and personalised images for individual users or improve the accessibility for users with disabilities. However, due to the nascent stage of this research area, there remains a knowledge gap in effectively utilising these models, which can clutter the digital space with poor-quality AI-generated, thus diminishing the overall perceived impact and the user experience. To address this issue, we propose a novel methodology aimed at generating high-quality prompts with minimal user effort. In particular, we present BLACK (Background, Lighting, Amenities, Context, and Kinesis), a prompt generation model directly designed for achieving high-quality images satisfying a proposed set of five desiderata. Through concrete examples, we demonstrate the impact of the prompting model in improving the generation quality. As a second contribution, we publicly release a structured resource of prompts along with expected results.
2023
GENerative, Explainable and Reasonable Artificial Learning Workshop
Torino, Italy
20 September 2023
Proceedings of the Workshop on GENerative, Explainable and Reasonable Artificial Learning co-located with the 15th Biannual Conference of the Italian SIGCHI Chapter (CHITALY 2023)
Federico Torrielli, Amon Rapp, Luigi Di Caro
3
11
https://ceur-ws.org/Vol-3571/short1.pdf
latent diffusion model, prompt engineering, image generation, generative ai, generative artificial intelligence
Federico Torrielli
File in questo prodotto:
File Dimensione Formato  
short1.pdf

Accesso aperto

Tipo di file: PDF EDITORIALE
Dimensione 11.26 MB
Formato Adobe PDF
11.26 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1947172
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact