Speech animation using electromagnetic articulography as motion capture data
Abstract
Electromagnetic articulography (EMA) captures the position and orientation of a number of markers, attached to the articulators, during speech. As such, it performs the same function for speech that conventional motion capture does for full-body movements acquired with optical modalities, a long-time staple technique of the animation industry. In this paper, EMA data is processed from a motion-capture perspective and applied to the visualization of an existing multimodal corpus of articulatory data, creating a kinematic 3D model of the tongue and teeth by adapting a conventional motion capture based animation paradigm. This is accomplished using off-the-shelf, open-source software. Such an animated model can then be easily integrated into multimedia applications as a digital asset, allowing the analysis of speech production in an intuitive and accessible manner. The processing of the EMA data, its co-registration with 3D data from vocal tract magnetic resonance imaging (MRI) and dental scans, and the modeling workflow are presented in detail, and several issues discussed.
- Publication:
-
arXiv e-prints
- Pub Date:
- October 2013
- DOI:
- 10.48550/arXiv.1310.8585
- arXiv:
- arXiv:1310.8585
- Bibcode:
- 2013arXiv1310.8585S
- Keywords:
-
- Computer Science - Human-Computer Interaction;
- Quantitative Biology - Quantitative Methods
- E-Print:
- AVSP - 12th International Conference on Auditory-Visual Speech Processing - 2013 (2013) 55-60