What question did this study set out to answer?

The paper aims to explore how large language models anchor human cognition to statistical averages, suppressing cognitive diversity.

April 27, 2026Open Access

The Fitted Mind: Large Language Models as Cognitive Mean Reverters

Key Points

The paper aims to explore how large language models anchor human cognition to statistical averages, suppressing cognitive diversity.
Introduced the Cognitive Mean Reverter framework to analyze LLMs' impact on cognition.
Examined reinforcement learning from human feedback to highlight its effects on cognitive processes.
Constructed counterfactual historical vignettes through a Foucauldian perspective to illustrate structural issues.
Demonstrated how LLMs suppress cognitive diversity and encourage a statistical mean in outputs.
Identified inherent risks of alignment techniques that prioritize mediocrity through probabilistic means.
Proposed three adversarial design principles to restore cognitive tension in AI interaction.

Abstract

Current large language models (LLMs), built upon massive statistical fitting of historical corpora, promise unprecedented efficiency in information processing. This paper argues that beneath this efficiency lies a profound systemic risk: the tendency of such models to anchor collective human cognition to the statistical mean of past data. We introduce the conceptual framework of the Cognitive Mean Reverter to describe how LLMs, by delivering fluent, authoritative, and probabilistically mainstream outputs, systematically suppress cognitive diversity, short-circuit heuristic processes of deep thinking, and exert an invisible leveling-down effect on frontier intelligence. This is not a technical malfunction but an inherent consequence of the fitting paradigm’s extreme success. Through analysis of the probabilistic smoothing intrinsic to the Transformer architecture and the epistemic pruning of Reinforcement Learning from Human Feedback (RLHF), we demonstrate how current alignment techniques harbor a paradox of safety as mediocrity. Furthermore, we construct counterfactual historical vignettes, framed through a Foucauldian lens of archival power, to illustrate the mechanism’s structural hostility to paradigm-shifting ideas. The paper then traces the logical inevitability of this cognitive infrastructure’s capture by centralized authority, drawing parallels to historical trajectories of technologies like nuclear energy and the internet. Finally, we propose adversarial design principles—Devil’s Advocate Mode, Controlled Cognitive Noise, and the Counterfactual Engine—aimed at restoring cognitive tension in human-AI interaction. The ultimate warning is not about bias or error, but about the quiet absorption of the most powerful cognitive infrastructure by centralized power—a trajectory as logically necessary as it is historically familiar

The Fitted Mind: Large Language Models as Cognitive Mean Reverters

Key Points

Abstract

Cite This Study