Discovering Boundary Values of Feature-based Machine Learning Classifiers through Exploratory Datamorphic Testing
Abstract
Testing has been widely recognised as difficult for AI applications. This paper proposes a set of testing strategies for testing machine learning applications in the framework of the datamorphism testing methodology. In these strategies, testing aims at exploring the data space of a classification or clustering application to discover the boundaries between classes that the machine learning application defines. This enables the tester to understand precisely the behaviour and function of the software under test. In the paper, three variants of exploratory strategies are presented with the algorithms implemented in the automated datamorphic testing tool Morphy. The correctness of these algorithms are formally proved. Their capability and cost of discovering borders between classes are evaluated via a set of controlled experiments with manually designed subjects and a set of case studies with real machine learning models.
- Publication:
-
arXiv e-prints
- Pub Date:
- October 2021
- DOI:
- 10.48550/arXiv.2110.00330
- arXiv:
- arXiv:2110.00330
- Bibcode:
- 2021arXiv211000330Z
- Keywords:
-
- Computer Science - Machine Learning;
- Computer Science - Software Engineering
- E-Print:
- Accepted on 20 January 2022 for publication in the Journal of Systems and Software