Key points are not available for this paper at this time.
When building applications from large vocabulary speech recognition systems, a certain amount of search errors due to pruning often has to be accepted in order to obtain the required speed. We tackle the problems resulting from aggressive pruning strategies as typically applied in large vocabulary systems to achieve close to real-time performance. We consider a typical scenario of a two pass Viterbi search with the first pass being organized as a phoneme (allophone) tree. For such a tree organized lexicon, there are two possibilities to use a bigram language model: either by building tree copies or by using so-called delayed bigrams. Since copying trees turns out to be too expensive for real time applications we basically refer to delayed bigrams, discuss their drastic influence on the word accuracy and show how to alleviate the disastrous effect of delayed bigrams under aggressive pruning.
Building similarity graph...
Analyzing shared references across papers
Loading...
Monika Woszczyna
Karlsruhe Institute of Technology
Michael Finke
Deutsches Zentrum für Luft- und Raumfahrt e. V. (DLR)
Carnegie Mellon University
Building similarity graph...
Analyzing shared references across papers
Loading...
Woszczyna et al. (Tue,) studied this question.
synapsesocial.com/papers/6a204d314ad5e85db1e71ae2 — DOI: https://doi.org/10.1109/icassp.1996.540309