LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases

doi:10.48550/arXiv.2501.03112

LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases

Large Language Models (LLMs) have been observed to exhibit bias in numerous ways, potentially creating or worsening outcomes for specific groups identified by protected attributes such as sex, race, sexual orientation, or age. To help address this gap, we introduce LangFair, an open-source Python package that aims to equip LLM practitioners with the tools to evaluate bias and fairness risks relevant to their specific use cases. The package offers functionality to easily generate evaluation datasets, comprised of LLM responses to use-case-specific prompts, and subsequently calculate applicable metrics for the practitioner's use case. To guide in metric selection, LangFair offers an actionable decision framework.

Publication:

arXiv e-prints

Pub Date:

January 2025

DOI:

10.48550/arXiv.2501.03112

arXiv:

arXiv:2501.03112

Bibcode:

2025arXiv250103112B

Keywords:

Computer Science - Computation and Language;
Computer Science - Artificial Intelligence;
Computer Science - Computers and Society;
Computer Science - Machine Learning

E-Print:

Journal of Open Source Software

ADS

LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases

Abstract