Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition

doi:10.48550/arXiv.2110.02220

Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition

Fast contextual adaptation has shown to be effective in improving Automatic Speech Recognition (ASR) of rare words and when combined with an on-device personalized training, it can yield an even better recognition result. However, the traditional re-scoring approaches based on an external language model is prone to diverge during the personalized training. In this work, we introduce a model-based end-to-end contextual adaptation approach that is decoder-agnostic and amenable to on-device personalization. Our on-device simulation experiments demonstrate that the proposed approach outperforms the traditional re-scoring technique by 12% relative WER and 15.7% entity mention specific F1-score in a continues personalization scenario.

Publication:

arXiv e-prints

Pub Date:

October 2021

DOI:

10.48550/arXiv.2110.02220

arXiv:

arXiv:2110.02220

Bibcode:

2021arXiv211002220M

Keywords:

Electrical Engineering and Systems Science - Audio and Speech Processing;
Computer Science - Artificial Intelligence;
Computer Science - Computation and Language;
Computer Science - Machine Learning;
Computer Science - Neural and Evolutionary Computing

E-Print:

5 pages, 3 figures, 3 tables

NASA/ADS

Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition

Abstract