Adversarial Consistency and the Uniqueness of the Adversarial Bayes Classifier

doi:10.48550/arXiv.2404.17358

Adversarial Consistency and the Uniqueness of the Adversarial Bayes Classifier

Frank, Natalie S.

Adversarial training is a common technique for learning robust classifiers. Prior work showed that convex surrogate losses are not statistically consistent in the adversarial context -- or in other words, a minimizing sequence of the adversarial surrogate risk will not necessarily minimize the adversarial classification error. We connect the consistency of adversarial surrogate losses to properties of minimizers to the adversarial classification risk, known as \emph{adversarial Bayes classifiers}. Specifically, under reasonable distributional assumptions, a convex loss is statistically consistent for adversarial learning iff the adversarial Bayes classifier satisfies a certain notion of uniqueness.

Publication:

arXiv e-prints

Pub Date:

April 2024

DOI:

10.48550/arXiv.2404.17358

arXiv:

arXiv:2404.17358

Bibcode:

2024arXiv240417358F

Keywords:

Computer Science - Machine Learning;
Mathematics - Statistics Theory;
Statistics - Machine Learning

E-Print:

18 pages, v2: fixed typos

NASA/ADS

Adversarial Consistency and the Uniqueness of the Adversarial Bayes Classifier

Abstract