Continually Improving Extractive QA via Human Feedback

doi:10.48550/arXiv.2305.12473

Continually Improving Extractive QA via Human Feedback

We study continually improving an extractive question answering (QA) system via human user feedback. We design and deploy an iterative approach, where information-seeking users ask questions, receive model-predicted answers, and provide feedback. We conduct experiments involving thousands of user interactions under diverse setups to broaden the understanding of learning from feedback over time. Our experiments show effective improvement from user feedback of extractive QA models over time across different data regimes, including significant potential for domain adaptation.

Publication:

arXiv e-prints

Pub Date:

May 2023

DOI:

10.48550/arXiv.2305.12473

arXiv:

arXiv:2305.12473

Bibcode:

2023arXiv230512473G

Keywords:

Computer Science - Computation and Language;
Computer Science - Artificial Intelligence;
Computer Science - Machine Learning

E-Print:

EMNLP 2023

NASA/ADS

Continually Improving Extractive QA via Human Feedback

Abstract