Are Gender-Neutral Queries Really Gender-Neutral? Mitigating Gender Bias in Image Search
Abstract
Internet search affects people's cognition of the world, so mitigating biases in search results and learning fair models is imperative for social good. We study a unique gender bias in image search in this work: the search images are often gender-imbalanced for gender-neutral natural language queries. We diagnose two typical image search models, the specialized model trained on in-domain datasets and the generalized representation model pre-trained on massive image and text data across the internet. Both models suffer from severe gender bias. Therefore, we introduce two novel debiasing approaches: an in-processing fair sampling method to address the gender imbalance issue for training models, and a post-processing feature clipping method base on mutual information to debias multimodal representations of pre-trained models. Extensive experiments on MS-COCO and Flickr30K benchmarks show that our methods significantly reduce the gender bias in image search models.
- Publication:
-
arXiv e-prints
- Pub Date:
- September 2021
- DOI:
- arXiv:
- arXiv:2109.05433
- Bibcode:
- 2021arXiv210905433W
- Keywords:
-
- Computer Science - Computer Vision and Pattern Recognition;
- Computer Science - Computation and Language;
- I.2.7
- E-Print:
- 14 pages, EMNLP 2021