August 27, 2007

PocketSUMMIT: small-footprint continuous speech recognition

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

We present PocketSUMMIT, a small-footprint version of our SUMMIT continuous speech recognition system. With portable devices becoming smaller and more powerful, speech is increasingly becoming an important input modality on these devices. PocketSUMMIT is implemented as a variable-rate continuous density hidden Markov model with diphone context-dependent models. We explore various Gaussian parameter quantization schemes and find 8:1 compression or more is achievable with little reduction in accuracy. We also show how the quantized parameters can be used for rapid table lookup. We explore firstpass language model pruning in a finite-state transducer (FST) framework, as well as FST and n-gram weight quantization and bit packing, to further reduce memory usage. PocketSUMMIT is currently able to run a moderate vocabulary conversational speech recognition system in real time in a few MB on current PDAs and smart phones. Index Terms: speech recognition, small footprint, parameter quantization, finite-state transducer

Me gusta

Guardar

Cite This Study

I. Lee Hetherington (Mon,) studied this question.

synapsesocial.com/papers/6a201778349f479269fbdee2 https://doi.org/https://doi.org/10.21437/interspeech.2007-425

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Me gusta

Guardar