Investigating Labeler Bias in Face Annotation for Machine Learning
Abstract
In a world increasingly reliant on artificial intelligence, it is more important than ever to consider the ethical implications of artificial intelligence on humanity. One key under-explored challenge is labeler bias, which can create inherently biased datasets for training and subsequently lead to inaccurate or unfair decisions in healthcare, employment, education, and law enforcement. Hence, we conducted a study to investigate and measure the existence of labeler bias using images of people from different ethnicities and sexes in a labeling task. Our results show that participants possess stereotypes that influence their decision-making process and that labeler demographics impact assigned labels. We also discuss how labeler bias influences datasets and, subsequently, the models trained on them. Overall, a high degree of transparency must be maintained throughout the entire artificial intelligence training process to identify and correct biases in the data as early as possible.
- Publication:
-
arXiv e-prints
- Pub Date:
- January 2023
- DOI:
- 10.48550/arXiv.2301.09902
- arXiv:
- arXiv:2301.09902
- Bibcode:
- 2023arXiv230109902H
- Keywords:
-
- Computer Science - Machine Learning;
- Computer Science - Human-Computer Interaction
- E-Print:
- Frontiers in Artificial Intelligence and Applications (2024) 145-162