August 1, 1984

Estimation of probabilities in the language model of the IBM speech recognition system

Key Points

Key points are not available for this paper at this time.

Abstract

The language model probabilities are estimated by an empirical Bayes approach in which a prior distribution for the unknown probabilities is itself estimated through a novel choice of data. The predictive power of the model thus fitted is compared by means of its experimental perplexity 1 to the model as fitted by the Jelinek-Mercer deleted estimator and as fitted by the Turing-Good formulas for probabilities of unseen or rarely seen events.

Mark Helpful

Bookmark

Relay

Cite This Study

Arthur Nádas (Wed,) studied this question.

synapsesocial.com/papers/6a10a2ae01be78fe8161252a https://doi.org/https://doi.org/10.1109/tassp.1984.1164378

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

1THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS1953 · 3,312 citations
2Applied Statistical Decision Theory1962 · 2,190 citations
3Hidden Markov chains, the forward-backward algorithm, and initial statistics1983 · 17 citations
4Prediction and Entropy of Printed English1951 · 2,696 citations
5The Population Frequencies of Species and the Estimation of Population Parameters1953 · 448 citations

Mark Helpful

Bookmark

Relay