April 6, 2011

Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition

Key Points

Key points are not available for this paper at this time.

Abstract

We propose a novel context-dependent (CD) model for large-vocabulary speech recognition (LVSR) that leverages recent advances in using deep belief networks for phone recognition. We describe a pre-trained deep neural network hidden Markov model (DNN-HMM) hybrid architecture that trains the DNN to produce a distribution over senones (tied triphone states) as its output. The deep belief network pre-training algorithm is a robust and often helpful way to initialize deep neural networks generatively that can aid in optimization and reduce generalization error. We illustrate the key components of our model, describe the procedure for applying CD-DNN-HMMs to LVSR, and analyze the effects of various modeling choices on performance. Experiments on a challenging business search dataset demonstrate that CD-DNN-HMMs can significantly outperform the conventional context-dependent Gaussian mixture model (GMM)-HMMs, with an absolute sentence accuracy improvement of 5.8% and 9.2% (or relative error reduction of 16.0% and 23.2%) over the CD-GMM-HMMs trained using the minimum phone error rate (MPE) and maximum-likelihood (ML) criteria, respectively.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

George E. Dahl

Swarthmore College

Dong Yu

Microsoft (United States)

Li Deng

Beijing University of Posts and Telecommunications

Journals

IEEE Transactions on Audio Speech and Language Processing

Actions

Institutions

University of Toronto

Microsoft (United States)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study