Key points are not available for this paper at this time.
Today, most of the state-of-the-art speech recognizers are based on hidden Markov modeling. Using semi-continuous or continuous density hidden Markov models, the computation of emission probabilities requires the evaluation of mixture Gaussian probability density functions. Since it is very expensive to evaluate all the Gaussians of the mixture density codebook, many recognizers only compute the M most significant Gaussians (M=1,...,8). This paper presents an alternative approach to approximate mixture Gaussians with diagonal covariance matrices, based on a binary feature space partitioning tree. The proposed algorithm is experimentally evaluated in the context of large vocabulary, speaker independent, spontaneous speech recognition using the JANUS-2 speech recognizer. In the case of mixtures with 50 Gaussians, we achieve a speedup of 2-5 in the computation of HMM emission probabilities, without affecting the accuracy of the system.
Building similarity graph...
Analyzing shared references across papers
Loading...
Jürgen Fritsch
University Hospital Regensburg
Ivica Rogina
Carnegie Mellon University
Building similarity graph...
Analyzing shared references across papers
Loading...
Fritsch et al. (Tue,) studied this question.
synapsesocial.com/papers/6a204d314ad5e85db1e71ae3 — DOI: https://doi.org/10.1109/icassp.1996.543251
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: