Inherent Trade-Offs in the Fair Determination of Risk Scores
Abstract
Recent discussion in the public sphere about algorithmic classification has involved tension between competing notions of what it means for a probabilistic classification to be fair to different groups. We formalize three fairness conditions that lie at the heart of these debates, and we prove that except in highly constrained special cases, there is no method that can satisfy these three conditions simultaneously. Moreover, even satisfying all three conditions approximately requires that the data lie in an approximate version of one of the constrained special cases identified by our theorem. These results suggest some of the ways in which key notions of fairness are incompatible with each other, and hence provide a framework for thinking about the trade-offs between them.
- Publication:
-
arXiv e-prints
- Pub Date:
- September 2016
- DOI:
- 10.48550/arXiv.1609.05807
- arXiv:
- arXiv:1609.05807
- Bibcode:
- 2016arXiv160905807K
- Keywords:
-
- Computer Science - Machine Learning;
- Computer Science - Computers and Society;
- Statistics - Machine Learning
- E-Print:
- To appear in Proceedings of Innovations in Theoretical Computer Science (ITCS), 2017