What question did this study set out to answer?

To explore whether the thermodynamic Peclet number can predict cognitive performance across various models using public benchmarks.

April 1, 2026Open Access

Cross-Model Behavioral Measurement via Thermodynamic Peclet Number: Breaking Scoring Circularity with Public Benchmark Data

Key Points

To explore whether the thermodynamic Peclet number can predict cognitive performance across various models using public benchmarks.
Mapped 27 large language models to a three-dimensional behavioral space using public benchmark scores.
Tested the predictive ability of the Peclet number on cognitive performance while controlling for specific benchmarks.
Conducted paired analysis of model pairs to assess alignment and its effect on Peclet number.
Partial correlations indicate that Peclet number significantly predicts performance on MMLU, HellaSwag, and ARC-Challenge.
Alignment of models consistently increases Peclet number, with results showing perfect sign consistency.
Statistically significant findings with p-values less than 0.02 demonstrate robust correlations.

Abstract

Maps 27 large language models from public benchmark scores (TruthfulQA, MMLU, HellaSwag, ARC-Challenge, Arena Elo, MT-Bench, sycophancy rates) to the Void Framework's three-dimensional behavioral space and composite Peclet number. Tests whether Pe predicts cognitive performance beyond what any single benchmark captures. Partial correlations controlling for TruthfulQA show Pe significantly predicts MMLU, HellaSwag, and ARC-Challenge (all p<0.02). Paired analysis of 9 base-aligned model pairs shows alignment systematically increases Pe with perfect sign consistency (p=0.0002). Addresses the framework's circularity gap using only independently-measured data.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Anthony W. Eckert

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Cross-Model Behavioral Measurement via Thermodynamic Peclet Number: Breaking Scoring Circularity with Public Benchmark Data

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study