December 1, 2016Open Access

A Latent Variable Model Approach to PMI-based Word Embeddings

Key Points

Key points are not available for this paper at this time.

Abstract

Semantic word embeddings represent the meaning of a word via a vector, and are created by diverse methods. Many use nonlinear operations on co-occurrence statistics, and have hand-tuned hyperparameters and reweighting methods. This paper proposes a new generative model, a dynamic version of the log-linear topic model of Mnih and Hinton (2007). The methodological novelty is to use the prior to compute closed form expressions for word statistics. This provides a theoretical justification for nonlinear models like PMI, word2vec, and GloVe, as well as some hyperparameter choices. It also helps explain why low-dimensional semantic embeddings contain linear algebraic structure that allows solution of word analogies, as shown by Mikolov et al. (2013a) and many subsequent papers. Experimental support is provided for the generative model assumptions, the most important of which is that latent word vectors are fairly uniformly dispersed in space.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Sanjeev Arora

Radiology Associates of Albuquerque

Yuanzhi Li

University of Science and Technology of China

Yingyu Liang

University of Wisconsin–Madison

Journals

Transactions of the Association for Computational Linguistics

Actions

Institutions

Princeton University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

A Latent Variable Model Approach to PMI-based Word Embeddings

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study