August 25, 2013Open Access

Sequence-discriminative training of deep neural networks

Key Points

Key points are not available for this paper at this time.

Abstract

Sequence-discriminative training of deep neural networks (DNNs) is investigated on a 300 hour American English conversational telephone speech task. Different sequence-discriminative criteria ndash;- maximum mutual information (MMI), minimum phone error (MPE), state-level minimum Bayes risk (sMBR), and boosted MMI ndash;- are compared. Two different heuristics are investigated to improve the performance of the DNNs trained using sequence-based criteria ndash;- lattices are re-generated after the first iteration of training; and, for MMI and BMMI, the frames where the numerator and denominator hypotheses are disjoint are removed from the gradient computation. Starting from a competitive DNN baseline trained using cross-entropy, different sequence-discriminative criteria are shown to lower word error rates by 8-9% relative, on average. Little difference is noticed between the different sequence-based criteria that are investigated. The experiments are done using the open-source Kaldi toolkit, which makes it possible for the wider community to reproduce these results.

Bookmark

View Full Paper

Cite This Study

Veselý et al. (Sun,) studied this question.

synapsesocial.com/papers/6a0ea3d9aa1655e5fb2297eb https://doi.org/https://doi.org/10.21437/interspeech.2013-548

Bookmark

View Full Paper