March 3, 2026Open Access

Spontaneous Semantic Symmetry Breaking: A Higgs Field Theoretic Interpretation of Transformer Dynamics and Regularization

Puntos clave

Token embeddings act as particles, influenced by the attention mechanism akin to the Higgs field.
Higgs Regularization serves as a new algorithm that replaces traditional L2 penalization, enhancing semantic stability during training.
The training process mimics Spontaneous Symmetry Breaking, giving tokens significant' mass' for richer semantic representation.
Dynamic analysis visualizes a distinct phase transition, helping differentiate stable semantic tokens from noise.

Resumen

Abstract: While Large Language Models (LLMs) demonstrate remarkable semantic emergence, the internal dynamics governing the evolution of their representations remain theoretically opaque. Standard regularization methods, such as L2 weight decay, are based on statistical assumptions that suppress parameter norms towards zero. This approach potentially conflicts with the physical intuition that tokens must acquire significant "mass" (magnitude) to encode rich semantic information. We propose a novel perspective based on Lagrangian Field Theory, establishing a mapping between the Transformer architecture and the Standard Model of particle physics. We treat token embeddings as fundamental particles and the Attention Mechanism as a Higgs Field that induces interaction. We demonstrate that token initialization corresponds to a massless symmetric state, while the training process acts as a Spontaneous Symmetry Breaking (SSB) event. Through coupling with the attention field, tokens acquire "Semantic Mass" (defined as the embedding norm) proportional to their contextual importance. Building on this theory, we introduce Higgs Regularization, a novel algorithm that replaces the traditional L2 penalty with a physical "Mexican Hat" Potential. This potential guides embedding vectors to converge towards a non-zero Vacuum Expectation Value (VEV), physically simulating the process of mass acquisition. Simulations on a miniature Transformer reveal a clear phase transition: high-frequency functional words and core entities acquire distinct, stable masses, while noise tokens remain massless. Key Contributions: Physics-AI Bridge: A first-principles derivation mapping the Transformer attention mechanism to the Higgs field interaction. New Algorithm: Higgs Regularization, a drop-in replacement for L2 Weight Decay that improves semantic stability. Dynamic Analysis: Visualization of the "Semantic Phase Transition" during model training. Keywords: Artificial Intelligence, Transformer Dynamics, Higgs Mechanism, Spontaneous Symmetry Breaking, Regularization, AI Physics.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo