Post-processing for Individual Fairness

doi:10.48550/arXiv.2110.13796

Post-processing for Individual Fairness

Post-processing in algorithmic fairness is a versatile approach for correcting bias in ML systems that are already used in production. The main appeal of post-processing is that it avoids expensive retraining. In this work, we propose general post-processing algorithms for individual fairness (IF). We consider a setting where the learner only has access to the predictions of the original model and a similarity graph between individuals, guiding the desired fairness constraints. We cast the IF post-processing problem as a graph smoothing problem corresponding to graph Laplacian regularization that preserves the desired "treat similar individuals similarly" interpretation. Our theoretical results demonstrate the connection of the new objective function to a local relaxation of the original individual fairness. Empirically, our post-processing algorithms correct individual biases in large-scale NLP models such as BERT, while preserving accuracy.

Publication:

arXiv e-prints

Pub Date:

October 2021

DOI:

10.48550/arXiv.2110.13796

arXiv:

arXiv:2110.13796

Bibcode:

2021arXiv211013796P

Keywords:

Statistics - Machine Learning;
Computer Science - Machine Learning

E-Print:

Published at NeurIPS 2021, Code @ https://github.com/Felix-Petersen/fairness-post-processing, Video @ https://www.youtube.com/watch?v=9PyKODDewPA

NASA/ADS

Post-processing for Individual Fairness

Abstract