What question did this study set out to answer?

This research aims to investigate geometric instability patterns in hallucination events across various large language models.

May 13, 2026Open Access

Structural Alignment of Semantic Collapse: Cross-Model Analysis of Hallucination Smoothing Dynamics

Key Points

This research aims to investigate geometric instability patterns in hallucination events across various large language models.
Employs Gromov-Wasserstein distance for aligning hidden-state manifolds of Llama-3-8B, Mistral-7B, and Qwen2-7B.
Identifies timing differences in smoothing phases using a time-shifted layer sweep.
Assesses relational similarity of failure trajectories across model architectures.
22% to 72% depth failure trajectories show higher relational similarity than correct trajectories (Δ GW < 0).
Mistral-7B exhibits structural alignment with Qwen2 and Llama-3 during intermediate processing despite seeming divergence at the output.
Introduces the 'Semantic Runtime Kernel' to ensure proactive safety in AI models.

Abstract

Title Structural Alignment of Semantic Collapse: Cross-Model Analysis of Hallucination Smoothing Dynamics Description This paper presents empirical evidence of a shared geometric instability pattern in hallucination events that transcends specific large language model (LLM) architectures. By employing Gromov-Wasserstein (GW) distance to align the hidden-state manifolds of Llama-3-8B, Mistral-7B, and Qwen2-7B, we identify a Phase-Shifted Structural Alignment. Key Findings: Smoothing Dynamics: We define "smoothing" as the latent process by which transformer architectures reshape internal semantic contradictions into plausible-looking outputs. Our analysis reveals that models differ primarily in their smoothing phase timing (layer depth) rather than the structure of the underlying collapse. Universal Alignment Window: Failure trajectories between 22% and 72% depth exhibit reliably higher relational similarity across disparate architectures than correct trajectories (GW < 0). Phase-Shift Discovery: While Mistral-7B initially appears to diverge at the output layer (L31), a time-shifted layer sweep confirms strong structural alignment with Qwen2 and Llama-3 during intermediate processing phases. Semantic Runtime Kernel: These results provide the mathematical foundation for a model-agnostic safety layer—a "Semantic Runtime Kernel"—capable of intercepting agent failure modes in real-time, independent of the underlying model's coordinate system. This work marks the completion of AIIE Phase 3, transitioning AI safety from post-hoc moderation to proactive runtime reliability engineering.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper