Discrete utterance speech recognition without time normalization

Key Points

Key points are not available for this paper at this time.

Abstract

We present a new, fast method for discrete utterance recognition of telephone bandwidth speech. The method is based on speech coding by vector quantization and minimum cross-entropy pattern classification. Separate vector quantization codebooks are designed from training sequences for each word in the recognition vocabulary. Inputs from outside the training sequence are classified by performing vector quantization and finding the codebook that achieves the lowest average distortion per speech frame. The new method obviates time normalization and uses approximately 6000 bits to represent each utterance in the recognition vocabulary. Preliminary limited testing on speaker dependent digit recognition has demonstrated excellent performance. Detailed tests are now in progress.

Bookmark

Cite This Study

Shore et al. (Thu,) studied this question.

synapsesocial.com/papers/6a1283608edbaba0bf676b68 https://doi.org/https://doi.org/10.1109/icassp.1982.1171884

Bookmark