a Model for the Synthesis of Natural Sounding Vowels.
Abstract
A model has been developed which is designed to preserve some of the naturalness that is usually lost in speech synthesis. A parameterized function is used to produce an approximation to the cross-sectional area through the glottis. A circuit model of the subglottal and glottal system is used with the supraglottal pressure to generate the glottal volume-velocity. The tract used to obtain the supraglottal pressure is represented by its input-impedance impulse-response which can be calculated from the area function of the tract. A convolution of the input-impedance impulse-response with the volume velocity determines the supraglottal pressure. The two coupled equations for the volume velocity are solved simultaneously. The output of the model is generated by convolving the resulting glottal volume-velocity with the transfer-function impulse-response of the tract. This technique preserves the interaction between the glottal flow and the vocal tract which is usually lost. A comparison is made between vowels synthesized with and without the vocal-tract, glottal-flow interaction. Listening tests showed that vowels synthesized with the interaction were preferred as more natural sounding than those without the interaction.
- Publication:
-
Ph.D. Thesis
- Pub Date:
- 1983
- Bibcode:
- 1983PhDT........52A
- Keywords:
-
- Physics: Acoustics