Key points are not available for this paper at this time.
When building applications from large vocabulary speech recognition systems, a certain amount of search errors due to pruning often has to be accepted in order to obtain the required speed. We tackle the problems resulting from aggressive pruning strategies as typically applied in large vocabulary systems to achieve close to real-time performance. We consider a typical scenario of a two pass Viterbi search with the first pass being organized as a phoneme (allophone) tree. For such a tree organized lexicon, there are two possibilities to use a bigram language model: either by building tree copies or by using so-called delayed bigrams. Since copying trees turns out to be too expensive for real time applications we basically refer to delayed bigrams, discuss their drastic influence on the word accuracy and show how to alleviate the disastrous effect of delayed bigrams under aggressive pruning.
Building similarity graph...
Analyzing shared references across papers
Loading...
Carnegie Mellon University
Add This Paper to Your Research Feed
Any time a new paper drops it will be there.
Woszczyna et al. (Tue,) studied this question.