Hidden Markov models in automatic speech recognition
Abstract
This article describes a method for constructing an automatic speech recognition system based on hidden Markov models (HMMs). The author discusses the basic concepts of HMM theory and the application of these models to the analysis and recognition of speech signals. The author provides algorithms which make it possible to train the ASR system and recognize signals on the basis of distinct stochastic models of selected speech sound classes. The author describes the specific components of the system and the procedures used to model and recognize speech. The author discusses problems associated with the choice of optimal signal detection and parameterization characteristics and their effect on the performance of the system. The author presents different options for the choice of speech signal segments and their consequences for the ASR process. The author gives special attention to the use of lexical, syntactic, and semantic information for the purpose of improving the quality and efficiency of the system. The author also describes an ASR system developed by the Speech Acoustics Laboratory of the IBPT PAS. The author discusses the results of experiments on the effect of noise on the performance of the ASR system and describes methods of constructing HMM's designed to operate in a noisy environment. The author also describes a language for human-robot communications which was defined as a complex multilevel network from an HMM model of speech sounds geared towards Polish inflections. The author also added mandatory lexical and syntactic rules to the system for its communications vocabulary.
- Publication:
-
NASA STI/Recon Technical Report N
- Pub Date:
- November 1993
- Bibcode:
- 1993STIN...9510239W
- Keywords:
-
- Algorithms;
- Human-Computer Interface;
- Markov Processes;
- Signal Analysis;
- Speech Recognition;
- Voice Control;
- Noise (Sound);
- Parameterization;
- Semantics;
- Signal Detection;
- Syntax;
- Communications and Radar