Understanding Intrinsic Robustness Using Label Uncertainty

doi:10.48550/arXiv.2107.03250

Understanding Intrinsic Robustness Using Label Uncertainty

A fundamental question in adversarial machine learning is whether a robust classifier exists for a given task. A line of research has made some progress towards this goal by studying the concentration of measure, but we argue standard concentration fails to fully characterize the intrinsic robustness of a classification problem since it ignores data labels which are essential to any classification task. Building on a novel definition of label uncertainty, we empirically demonstrate that error regions induced by state-of-the-art models tend to have much higher label uncertainty than randomly-selected subsets. This observation motivates us to adapt a concentration estimation algorithm to account for label uncertainty, resulting in more accurate intrinsic robustness measures for benchmark image classification problems.

Publication:

arXiv e-prints

Pub Date:

July 2021

DOI:

10.48550/arXiv.2107.03250

arXiv:

arXiv:2107.03250

Bibcode:

2021arXiv210703250Z

Keywords:

Computer Science - Machine Learning;
Computer Science - Cryptography and Security

E-Print:

ICLR 2022

NASA/ADS

Understanding Intrinsic Robustness Using Label Uncertainty

Abstract