What question did this study set out to answer?

This research aims to understand the geometric properties of transformer models using the spectral slope as a diagnostic tool.

April 17, 2026Open Access

The Representational Budget: Scale, RL, and Multimodal Alignment Compete for Geometric Potential in Transformers

Key Points

This research aims to understand the geometric properties of transformer models using the spectral slope as a diagnostic tool.
Introduced spectral slope S(ℓ) to analyze PCA eigenvalues of hidden states
Conducted experiments across 13 models from 5 architecture families
Examined variations in reinforcement learning intensity and modality count
Monitored changes in spectral properties during model training
Spectral expansion decreases with model size in the Qwen3 family
Participation ratio significantly correlates with reinforcement learning intensity
Chain-of-thought reasoning mitigates compression effects from RL
Mixture of Experts (MoE) routing enhances spectral diversity
PR collapse observed consistently across model sizes and RL techniques

Abstract

We introduce the spectral slope S(ℓ)—the log-linear decay rate of PCA eigenvalues computed from hidden-state representations at layer ℓ—as a cheap, per-layer diagnostic scalar for Transformer geometry. Across four rounds of experiments on 13 models from 5 architecture families (0.6B–30B parameters, dense and MoE, with varying RL intensity and modality count), we find that (1) per-layer spectral expansion ΔS/L decays monotonically with log N within the Qwen3 family (r=−0.968, p=0.007); (2) output-layer participation ratio PR tracks RL training intensity from 13.3 (base) to 4.3 (extreme RL); (3) chain-of-thought reasoning reverses RL-induced compression at runtime; (4) MoE routing increases aggregate spectral diversity; (5) post-hoc multimodal alignment consumes mid-layer spectral budget; and (6) real-time monitoring during DPO and GRPO training reveals that PR collapse is universal across 3 model sizes and 2 RL methods, with a characteristic danger window at step ~100 invisible to loss and reward curves. Code and toolkit: https://github.com/HenryZ838978/spectral-flow-probe

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

jing zhang (Wed,) studied this question.

synapsesocial.com/papers/69e1cefb5cdc762e9d857e5a https://doi.org/https://doi.org/10.5281/zenodo.19585082

Bookmark

View Full Paper