In Search of Autocorrelation Based Vocal Cord Cues for Speaker Identification
Abstract
In this paper we investigate a technique to find out vocal source based features from the LP residual of speech signal for automatic speaker identification. Autocorrelation with some specific lag is computed for the residual signal to derive these features. Compared to traditional features like MFCC, PLPCC which represent vocal tract information, these features represent complementary vocal cord information. Our experiment in fusing these two sources of information in representing speaker characteristics yield better speaker identification accuracy. We have used Gaussian mixture model (GMM) based speaker modeling and results are shown on two public databases to validate our proposition.
- Publication:
-
arXiv e-prints
- Pub Date:
- May 2011
- DOI:
- 10.48550/arXiv.1105.2095
- arXiv:
- arXiv:1105.2095
- Bibcode:
- 2011arXiv1105.2095S
- Keywords:
-
- Computer Science - Human-Computer Interaction
- E-Print:
- Proceedings of 2nd International Conference on RF &