Universal Adversarial Perturbations for Speech Recognition Systems

doi:10.48550/arXiv.1905.03828

Universal Adversarial Perturbations for Speech Recognition Systems

In this work, we demonstrate the existence of universal adversarial audio perturbations that cause mis-transcription of audio signals by automatic speech recognition (ASR) systems. We propose an algorithm to find a single quasi-imperceptible perturbation, which when added to any arbitrary speech signal, will most likely fool the victim speech recognition model. Our experiments demonstrate the application of our proposed technique by crafting audio-agnostic universal perturbations for the state-of-the-art ASR system -- Mozilla DeepSpeech. Additionally, we show that such perturbations generalize to a significant extent across models that are not available during training, by performing a transferability test on a WaveNet based ASR system.

Publication:

arXiv e-prints

Pub Date:

May 2019

DOI:

10.48550/arXiv.1905.03828

arXiv:

arXiv:1905.03828

Bibcode:

2019arXiv190503828N

Keywords:

Computer Science - Machine Learning;
Computer Science - Sound;
Electrical Engineering and Systems Science - Audio and Speech Processing;
Statistics - Machine Learning

E-Print:

Published as a conference paper at INTERSPEECH 2019

NASA/ADS

Universal Adversarial Perturbations for Speech Recognition Systems

Abstract