Analyzing Quality, Bias, and Performance in Text-to-Image Generative Models
Abstract
Advances in generative models have led to significant interest in image synthesis, demonstrating the ability to generate high-quality images for a diverse range of text prompts. Despite this progress, most studies ignore the presence of bias. In this paper, we examine several text-to-image models not only by qualitatively assessing their performance in generating accurate images of human faces, groups, and specified numbers of objects but also by presenting a social bias analysis. As expected, models with larger capacity generate higher-quality images. However, we also document the inherent gender or social biases these models possess, offering a more complete understanding of their impact and limitations.
- Publication:
-
arXiv e-prints
- Pub Date:
- June 2024
- DOI:
- 10.48550/arXiv.2407.00138
- arXiv:
- arXiv:2407.00138
- Bibcode:
- 2024arXiv240700138M
- Keywords:
-
- Computer Science - Artificial Intelligence;
- Computer Science - Computer Vision and Pattern Recognition;
- I.2.6;
- I.2.10;
- I.2.7;
- I.4.10
- E-Print:
- 20 pages, 8 figures