What question did this study set out to answer?

To propose a geometric framework that interprets language as transformations instead of sequences, enhancing understanding of large language models.

June 17, 2026Open Access

Language Geometry: A Unified Geometric Framework forUnderstanding Large Language Models

Key Points

To propose a geometric framework that interprets language as transformations instead of sequences, enhancing understanding of large language models.
Conducted seven tests with synthetic data, GloVe, and BERT to validate core propositions.
Reinterpreted key components of language models using geometric principles like principal bundles and frame transformations.
Developed visualization tools to aid in understanding and validating large language models.
Demonstrated low-dimensional structure of difference vectors (2-5 dimensions).
Established that different syntactic relations correspond to distinct subspaces.
Showed emergence of non-linearity from twisted gluing of local sections.

Abstract

This paper presents a unified geometric framework for understanding the working mechanisms of large language models. The core thesis is: language is not a sequence of positions, but a sequence of transformations. We reinterpret the word embedding space asfibers of a principal bundle, attention mechanisms as frame transformations, and languagegeneration as probabilistic path sampling of difference vectors. Seven tests (synthetic data,GloVe, BERT) validate the core propositions: difference vectors have low-dimensional structure (2-5 dimensions), different syntactic relations correspond to different subspaces, localsections can glue under compatibility conditions, and non-linearity emerges from twistedgluing. The paper also elucidates backpropagation as a feature extractor that discoversbasic concepts from human cognition, and unifies induction and inference as two directionsof the same probabilistic mechanism, encoding and decoding as bidirectional mappings, andtranslation as different walking orders on the same geometric path. A suite of visualization tools is developed, offering a geometric perspective for understanding, debugging, andvalidating large language models.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Xiaobo Li (Tue,) studied this question.

synapsesocial.com/papers/6a323cc9d50b63ecad206dcd https://doi.org/https://doi.org/10.5281/zenodo.20709063

Bookmark

View Full Paper