June 16, 2014

Neural Network Bottleneck Features for Language Identification

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

This paper presents the application of Neural Network Bot-tleneck (BN) features in Language Identification (LID). BN fea-tures are generally used for Large Vocabulary Speech Recogni-tion in conjunction with conventional acoustic features, such as MFCC or PLP. We compare the BN features to several common types of acoustic features used in the state-of-the-art LID sys-tems. The test set is from DARPA RATS (Robust Automatic Transcription of Speech) program, which seeks to advance state-of-the-art detection capabilities on audio from highly de-graded radio communication channels. On this type of noisy data, we show that in average, the BN features provide a 45% relative improvement in the Cavgor Equal Error Rate (EER) metrics across several test duration conditions, with respect to our single best acoustic features. Index Terms: language identification, noisy speech, robust feature extraction

Preguntar a la IA

Me gusta

Guardar

Cite This Study

Matějka et al. (Mon,) studied this question.

synapsesocial.com/papers/6a2009c935281a23f90dd4e1 https://doi.org/https://doi.org/10.21437/odyssey.2014-45

Preguntar a la IA

Me gusta

Guardar