What question did this study set out to answer?

July 4, 2026Open Access

Mathematical Foundations of Compositional Language Models

Key Points

This work aims to clarify the mathematical requirements for compositionality in language models.
Examined paradigms from n-grams to deep neural architectures through homomorphisms.
Proposed formalizing learning as an inverse problem using representation theory.
Suggested leveraging geometric machine learning and algebraic structures to enhance compositional generalization.
Identified limitations of existing models converge on the curse of dimensionality.
Demonstrated that learning compositionality is an ill-posed inverse problem.
Proposed the use of symmetries in representation theory to constrain hypothesis space for better model performance.

Abstract

Large Language Models exhibit impressive linguistic competence, yet this has given rise to a foundationaldebate as to whether their capacity for generalization stems from human-like systematicity or mere stochastic parroting.Central to this is the principle of compositionality. This survey clarifies the mathematical requirements forcompositionality by examining major paradigms—from n-grams to deep neural architectures—through the lens ofhomomorphisms between syntactic and semantic algebras. We demonstrate that the limitations of these modelsconverge to the curse of dimensionality, rendering the learning of compositionality an inherently ill-posed inverseproblem. To address this, we propose formalizing learning as an inverse problem in representation theory, where linguisticsymmetries act as essential regularization to constrain the hypothesis space. We further suggest that geometricmachine learning, leveraging these symmetries as inductive biases, offers a novel mechanism, potentially utilizingmathematical formulations like Clifford algebra, where the establishment of homomorphisms serves as a critical indicatorof compositional generalization. This framework redirects the research focus toward elucidating the algebraicstructure of language itself.

KI fragen

Bookmark

View Full Paper