April 1, 1977

Normalization of vowels by vocal-tract length and its application to vowel identification

Key Points

Key points are not available for this paper at this time.

Abstract

A new approach to speech parameter normalization is presented in which no prior knowledge about the input speakers is required. The vocal-tract length and area function are first estimated from the acoustic speech waveform, and then the area function is normalized to an acoustic tube of the same shape having a certain reference length. The normalized formant frequencies are defined as the resonance frequencies of this acoustic tube. The distributions of unnormalized and normalized formant frequencies for 9 stationary American vowels were investigated with 14 male and 12 female speakers. Fairly compact distributions of the vowels in the normalized F 1 -F 2 -F 3 space were obtained. A preliminary identification test for stationary vowels based on this normalization method showed an expected average recognition rate of 84-96 percent for arbitrarily selected speakers, depending on the phonetic criteria adopted for defining "correct" identification.

AI에게 질문

Bookmark

Cite This Study

Hisashi Wakita (Fri,) studied this question.

synapsesocial.com/papers/6a212b731311b8b97096a035 https://doi.org/https://doi.org/10.1109/tassp.1977.1162929

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

AI에게 질문

Bookmark