Unrestricted Adversarial Examples

doi:10.48550/arXiv.1809.08352

Unrestricted Adversarial Examples

We introduce a two-player contest for evaluating the safety and robustness of machine learning systems, with a large prize pool. Unlike most prior work in ML robustness, which studies norm-constrained adversaries, we shift our focus to unconstrained adversaries. Defenders submit machine learning models, and try to achieve high accuracy and coverage on non-adversarial data while making no confident mistakes on adversarial inputs. Attackers try to subvert defenses by finding arbitrary unambiguous inputs where the model assigns an incorrect label with high confidence. We propose a simple unambiguous dataset ("bird-or- bicycle") to use as part of this contest. We hope this contest will help to more comprehensively evaluate the worst-case adversarial risk of machine learning models.

Publication:

arXiv e-prints

Pub Date:

September 2018

DOI:

10.48550/arXiv.1809.08352

arXiv:

arXiv:1809.08352

Bibcode:

2018arXiv180908352B

Keywords:

Statistics - Machine Learning;
Computer Science - Computer Vision and Pattern Recognition;
Computer Science - Machine Learning

NASA/ADS

Unrestricted Adversarial Examples

Abstract