Key points are not available for this paper at this time.
The choice of method for training a speech recognizer is posed as an optimization problem. The currently used method of maximum likelihood, while heuristic, is shown to be superior under certain assumptions to another heuristic: the method of conditional maximum likelihood.
Arthur Nádas (Mon,) studied this question.