Fast Few-shot Debugging for NLU Test Suites
Abstract
We study few-shot debugging of transformer based natural language understanding models, using recently popularized test suites to not just diagnose but correct a problem. Given a few debugging examples of a certain phenomenon, and a held-out test set of the same phenomenon, we aim to maximize accuracy on the phenomenon at a minimal cost of accuracy on the original test set. We examine several methods that are faster than full epoch retraining. We introduce a new fast method, which samples a few in-danger examples from the original training set. Compared to fast methods using parameter distance constraints or Kullback-Leibler divergence, we achieve superior original accuracy for comparable debugging accuracy.
- Publication:
-
arXiv e-prints
- Pub Date:
- April 2022
- DOI:
- 10.48550/arXiv.2204.06555
- arXiv:
- arXiv:2204.06555
- Bibcode:
- 2022arXiv220406555M
- Keywords:
-
- Computer Science - Computation and Language
- E-Print:
- To appear at ACL 2022 Deep Learning Inside Out (DeeLIO) workshop