LangBiTe: A Platform for Testing Bias in Large Language Models

doi:10.48550/arXiv.2404.18558

LangBiTe: A Platform for Testing Bias in Large Language Models

The integration of Large Language Models (LLMs) into various software applications raises concerns about their potential biases. Typically, those models are trained on a vast amount of data scrapped from forums, websites, social media and other internet sources, which may instill harmful and discriminating behavior into the model. To address this issue, we present LangBiTe, a testing platform to systematically assess the presence of biases within an LLM. LangBiTe enables development teams to tailor their test scenarios, and automatically generate and execute the test cases according to a set of user-defined ethical requirements. Each test consists of a prompt fed into the LLM and a corresponding test oracle that scrutinizes the LLM's response for the identification of biases. LangBite provides users with the bias evaluation of LLMs, and end-to-end traceability between the initial ethical requirements and the insights obtained.

Publication:

arXiv e-prints

Pub Date:

April 2024

DOI:

10.48550/arXiv.2404.18558

arXiv:

arXiv:2404.18558

Bibcode:

2024arXiv240418558M

Keywords:

Computer Science - Software Engineering;
Computer Science - Artificial Intelligence

NASA/ADS

LangBiTe: A Platform for Testing Bias in Large Language Models

Abstract