Reliability Testing for Natural Language Processing Systems

doi:10.48550/arXiv.2105.02590

Reliability Testing for Natural Language Processing Systems

Questions of fairness, robustness, and transparency are paramount to address before deploying NLP systems. Central to these concerns is the question of reliability: Can NLP systems reliably treat different demographics fairly and function correctly in diverse and noisy environments? To address this, we argue for the need for reliability testing and contextualize it among existing work on improving accountability. We show how adversarial attacks can be reframed for this goal, via a framework for developing reliability tests. We argue that reliability testing -- with an emphasis on interdisciplinary collaboration -- will enable rigorous and targeted testing, and aid in the enactment and enforcement of industry standards.

Publication:

arXiv e-prints

Pub Date:

May 2021

DOI:

10.48550/arXiv.2105.02590

arXiv:

arXiv:2105.02590

Bibcode:

2021arXiv210502590T

Keywords:

Computer Science - Machine Learning;
Computer Science - Artificial Intelligence;
Computer Science - Computation and Language;
Computer Science - Computers and Society;
Computer Science - Neural and Evolutionary Computing

E-Print:

Accepted to ACL-IJCNLP 2021 (main conference). Camera-ready version

NASA/ADS

Reliability Testing for Natural Language Processing Systems

Abstract