The Creativity of Text-to-Image Generation

doi:10.48550/arXiv.2206.02904

The Creativity of Text-to-Image Generation

Oppenlaender, Jonas

Text-guided synthesis of images has made a giant leap towards becoming a mainstream phenomenon. With text-to-image generation systems, anybody can create digital images and artworks. This provokes the question of whether text-to-image generation is creative. This paper expounds on the nature of human creativity involved in text-to-image art (so-called "AI art") with a specific focus on the practice of prompt engineering. The paper argues that the current product-centered view of creativity falls short in the context of text-to-image generation. A case exemplifying this shortcoming is provided and the importance of online communities for the creative ecosystem of text-to-image art is highlighted. The paper provides a high-level summary of this online ecosystem drawing on Rhodes' conceptual four P model of creativity. Challenges for evaluating the creativity of text-to-image generation and opportunities for research on text-to-image generation in the field of Human-Computer Interaction (HCI) are discussed.

Publication:

arXiv e-prints

Pub Date:

May 2022

DOI:

10.48550/arXiv.2206.02904

arXiv:

arXiv:2206.02904

Bibcode:

2022arXiv220602904O

Keywords:

Computer Science - Human-Computer Interaction;
Computer Science - Graphics;
H.5;
H.m

E-Print:

doi:10.1145/3569219.3569352

NASA/ADS

The Creativity of Text-to-Image Generation

Abstract