May 1, 2014

Improving DNN speaker independence with I-vector inputs

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

We propose providing additional utterance-level features as inputs to a deep neural network (DNN) to facilitate speaker, channel and background normalization. Modifications of the basic algorithm are developed which result in significant reductions in word error rates (WERs). The algorithms are shown to combine well with speaker adaptation by backpropagation, resulting in a 9% relative WER reduction. We address implementation of the algorithm for a streaming task.

Preguntar a la IA

Me gusta

Guardar

Cite This Study

Senior et al. (Thu,) studied this question.

synapsesocial.com/papers/6a16af301375058a2905307c https://doi.org/https://doi.org/10.1109/icassp.2014.6853591

Preguntar a la IA

Me gusta

Guardar