Key points are not available for this paper at this time.
We present an estimate of an upper bound of 1.75 bits for the entropy of characters in printed English, obtained by constructing a word trigram model and then computing the cross-entropy between this model and a balanced sample of English text. We suggest the well-known and widely available Brown Corpus of printed English as a standard against which to measure progress in language modeling and offer our bound as the first of what we hope will be a series of steadily decreasing bounds.
Building similarity graph...
Analyzing shared references across papers
Loading...
Peter F. Brown
IBM (United States)
Vincent J. Della Pietra
IBM (United States)
Robert L. Mercer
IBM (United States)
Computational Linguistics
IBM (United States)
Building similarity graph...
Analyzing shared references across papers
Loading...
Brown et al. (Sun,) studied this question.
synapsesocial.com/papers/6a0f8a05d13714ec96fe4652 — DOI: https://doi.org/10.5555/146680.146685
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: