Anthropic's circuit tracing (2025) — attribution graphs revealing computational paths through language models — provides the microscope for measuring Péclet number at the feature level. We define Pe on attribution graph edges as the ratio of directed information flow (drift) to undirected spreading (diffusion) through cross-layer transcoder features. Jailbreak circuits should exhibit measurable Pe gradients: Pe increases along the computational path. The 12 jailbreak patterns detected by the Void Framework's Twilight monitoring should correspond to specific high-Pe subgraphs. This connects macroscopic Pe scoring (N=1,344 platforms) to microscopic mechanism, bridging mechanistic interpretability and thermodynamic field theory.
Anthony W. Eckert (Mon,) studied this question.