Latent Diffusion Models have recently emerged as the state-of-the-art approach for synthetic image generation. In the Web context, their adoption may significantly impact the way it is currently approached, from both sides of content generation and exploration. For example, future Web platforms may create alternative and personalised images for individual users or improve the accessibility for users with disabilities. However, due to the nascent stage of this research area, there remains a knowledge gap in effectively utilising these models, which can clutter the digital space with poor-quality AI-generated, thus diminishing the overall perceived impact and the user experience. To address this issue, we propose a novel methodology aimed at generating high-quality prompts with minimal user effort. In particular, we present BLACK (Background, Lighting, Amenities, Context, and Kinesis), a prompt generation model directly designed for achieving high-quality images satisfying a proposed set of five desiderata. Through concrete examples, we demonstrate the impact of the prompting model in improving the generation quality. As a second contribution, we publicly release a structured resource of prompts along with expected results.
Paint it, BLACK: a Novel Methodology for Prompting
Federico Torrielli
First
2023-01-01
Abstract
Latent Diffusion Models have recently emerged as the state-of-the-art approach for synthetic image generation. In the Web context, their adoption may significantly impact the way it is currently approached, from both sides of content generation and exploration. For example, future Web platforms may create alternative and personalised images for individual users or improve the accessibility for users with disabilities. However, due to the nascent stage of this research area, there remains a knowledge gap in effectively utilising these models, which can clutter the digital space with poor-quality AI-generated, thus diminishing the overall perceived impact and the user experience. To address this issue, we propose a novel methodology aimed at generating high-quality prompts with minimal user effort. In particular, we present BLACK (Background, Lighting, Amenities, Context, and Kinesis), a prompt generation model directly designed for achieving high-quality images satisfying a proposed set of five desiderata. Through concrete examples, we demonstrate the impact of the prompting model in improving the generation quality. As a second contribution, we publicly release a structured resource of prompts along with expected results.File | Dimensione | Formato | |
---|---|---|---|
short1.pdf
Accesso aperto
Tipo di file:
PDF EDITORIALE
Dimensione
11.26 MB
Formato
Adobe PDF
|
11.26 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.