Improved robustness to adversarial examples using Lipschitz regularization of the loss

doi:10.48550/arXiv.1810.00953

Improved robustness to adversarial examples using Lipschitz regularization of the loss

We augment adversarial training (AT) with worst case adversarial training (WCAT) which improves adversarial robustness by 11% over the current state-of-the-art result in the $\ell_2$ norm on CIFAR-10. We obtain verifiable average case and worst case robustness guarantees, based on the expected and maximum values of the norm of the gradient of the loss. We interpret adversarial training as Total Variation Regularization, which is a fundamental tool in mathematical image processing, and WCAT as Lipschitz regularization.

Publication:

arXiv e-prints

Pub Date:

October 2018

DOI:

10.48550/arXiv.1810.00953

arXiv:

arXiv:1810.00953

Bibcode:

2018arXiv181000953F

Keywords:

Computer Science - Machine Learning;
Computer Science - Cryptography and Security;
Computer Science - Computer Vision and Pattern Recognition;
Statistics - Machine Learning

E-Print:

Merged with arXiv:1808.09540

NASA/ADS

Improved robustness to adversarial examples using Lipschitz regularization of the loss

Abstract