July 1, 1999

Structure Learning in Conditional Probability Models via an Entropic Prior and Parameter Extinction

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

We introduce an entropic prior for multinomial parameter estimation problems and solve for its maximum a posteriori (MAP) estimator. The prior is a bias for maximally structured and minimally ambiguous models. In conditional probability models with hidden state, iterative MAP estimation drives weakly supported parameters toward extinction, effectively turning them off. Thus, structure discovery is folded into parameter estimation. We then establish criteria for simplifying a probabilistic model's graphical structure by trimming parameters and states, with a guarantee that any such deletion will increase the posterior probability of the model. Trimming accelerates learning by sparsifying the model. All operations monotonically and maximally increase the posterior probability, yielding structure-learning algorithms only slightly slower than parameter estimation via expectation-maximization and orders of magnitude faster than search-based structure induction. When applied to hidden Markov model training, the resulting models show superior generalization to held-out test data. In many cases the resulting models are so sparse and concise that they are interpretable, with hidden states that strongly correlate with meaningful categories.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Cite This Study

Matthew Brand (Thu,) studied this question.

synapsesocial.com/papers/6a15644479ff98d0de4e8ff0 https://doi.org/https://doi.org/10.1162/089976699300016395

Discussion

Journals

Neural Computation

Institutions

Mitsubishi Electric (United States)

References and Citations

Add This Paper to Your Research Feed

Any time a new paper drops it will be there.