AutoNLU: Detecting, root-causing, and fixing NLU model errors

doi:10.48550/arXiv.2110.06384

AutoNLU: Detecting, root-causing, and fixing NLU model errors

Improving the quality of Natural Language Understanding (NLU) models, and more specifically, task-oriented semantic parsing models, in production is a cumbersome task. In this work, we present a system called AutoNLU, which we designed to scale the NLU quality improvement process. It adds automation to three key steps: detection, attribution, and correction of model errors, i.e., bugs. We detected four times more failed tasks than with random sampling, finding that even a simple active learning sampling method on an uncalibrated model is surprisingly effective for this purpose. The AutoNLU tool empowered linguists to fix ten times more semantic parsing bugs than with prior manual processes, auto-correcting 65% of all identified bugs.

Publication:

arXiv e-prints

Pub Date:

October 2021

DOI:

10.48550/arXiv.2110.06384

arXiv:

arXiv:2110.06384

Bibcode:

2021arXiv211006384S

Keywords:

Computer Science - Computation and Language;
Computer Science - Machine Learning;
I.2.7

E-Print:

8 pages, 5 figures

NASA/ADS

AutoNLU: Detecting, root-causing, and fixing NLU model errors

Abstract