November 30, 1998

Unsupervised training of a speech recognizer using TV broadcasts

Key Points

Key points are not available for this paper at this time.

Abstract

Current speech recognition systems require large amounts of transcribed data for parameter estimation. The transcription, however, is tedious and expensive. In this work we describe our experiments which are aimed at training a speech recognizer without transcriptions. The experiments were carried out with TV newscasts, that were recorded using a satellite receiver and a simple MPEG coding hardware. The newscasts were automatically segmented into segments of similar acoustic background condition. This material is inexpensive and can be made available in large quantities, but there are no transcriptions available. We develop a training scheme, where a recognizer is bootstrapped using very little transcribed data and is improved using new, untranscribed speech. We show that it is necessary to use a confidence measure to judge the initial transcriptions of the recognizer before using them. Higher improvements can be achieved if the number of parameters in the system is increased when more...

KI fragen

Bookmark

Cite This Study

Kemp et al. (Mon,) studied this question.

synapsesocial.com/papers/6a1c6b202cc291e7bf2fc0a8 https://doi.org/https://doi.org/10.21437/icslp.1998-632

KI fragen

Bookmark