Inherent Trade-Offs in the Fair Determination of Risk Scores

doi:10.48550/arXiv.1609.05807

Inherent Trade-Offs in the Fair Determination of Risk Scores

Recent discussion in the public sphere about algorithmic classification has involved tension between competing notions of what it means for a probabilistic classification to be fair to different groups. We formalize three fairness conditions that lie at the heart of these debates, and we prove that except in highly constrained special cases, there is no method that can satisfy these three conditions simultaneously. Moreover, even satisfying all three conditions approximately requires that the data lie in an approximate version of one of the constrained special cases identified by our theorem. These results suggest some of the ways in which key notions of fairness are incompatible with each other, and hence provide a framework for thinking about the trade-offs between them.

Publication:

arXiv e-prints

Pub Date:

September 2016

DOI:

10.48550/arXiv.1609.05807

arXiv:

arXiv:1609.05807

Bibcode:

2016arXiv160905807K

Keywords:

Computer Science - Machine Learning;
Computer Science - Computers and Society;
Statistics - Machine Learning

E-Print:

To appear in Proceedings of Innovations in Theoretical Computer Science (ITCS), 2017

NASA/ADS

Inherent Trade-Offs in the Fair Determination of Risk Scores

Abstract