Provenance: A Light-weight Fact-checker for Retrieval Augmented LLM Generation Output

doi:10.48550/arXiv.2411.01022

Provenance: A Light-weight Fact-checker for Retrieval Augmented LLM Generation Output

We present a light-weight approach for detecting nonfactual outputs from retrieval-augmented generation (RAG). Given a context and putative output, we compute a factuality score that can be thresholded to yield a binary decision to check the results of LLM-based question-answering, summarization, or other systems. Unlike factuality checkers that themselves rely on LLMs, we use compact, open-source natural language inference (NLI) models that yield a freely accessible solution with low latency and low cost at run-time, and no need for LLM fine-tuning. The approach also enables downstream mitigation and correction of hallucinations, by tracing them back to specific context chunks. Our experiments show high area under the ROC curve (AUC) across a wide range of relevant open source datasets, indicating the effectiveness of our method for fact-checking RAG output.

Publication:

arXiv e-prints

Pub Date:

November 2024

DOI:

10.48550/arXiv.2411.01022

arXiv:

arXiv:2411.01022

Bibcode:

2024arXiv241101022S

Keywords:

Computer Science - Computation and Language

E-Print:

To appear in Proceedings of EMNLP 2024 Industry Track

NASA/ADS

Provenance: A Light-weight Fact-checker for Retrieval Augmented LLM Generation Output

Abstract