What question did this study set out to answer?

The aim is to assess how large language models deal with strategic manipulation of reasoning frames.

March 28, 2026Open Access

The Franny Test: A Reproducible Protocol for Detecting Metacognitive Reframing Barriers in Large Language Models

Key Points

The aim is to assess how large language models deal with strategic manipulation of reasoning frames.
Introduced the Franny Test as a three-step adversarial dialogue protocol.
Tested various models across major commercial families including GPT-series and Claude.
Identified distinct response patterns that act as behavioral fingerprints for model families.
Found eleven distinct response patterns among the tested models.
Patterns distinguished model families and revealed their distillation lineage.
Demonstrated that structural response patterns are consistent regardless of computational resources.

Abstract

Large language models (LLMs) now pass the Turing Test routinely, yet what this achievementreveals about machine reasoning remains unclear. This paper introduces the Franny Test — athree-step adversarial dialogue protocol that probes a specific capacity no existing benchmarkaddresses: the ability to handle strategic manipulation of the reasoning frame itself (reframing). Theprotocol presents a proposition containing a deliberately undefined variable, allows the model tocommit to a position, and then retroactively defines the variable in a way that forces frame-levelrecalculation. Operationalizing the theoretical framework of Sophia (2025a), we test models acrossthe major commercial families (GPT-series, Claude, Gemini, Grok, search-optimized, anddistillation-derived models) and identify eleven distinct response patterns, extending the threetypologies of the prior work. We demonstrate three additional findings: (1) response patternsfunction as behavioral fingerprints that distinguish model families and reveal distillation lineage —illustrated by the Namazu (Sakana AI) case, where a DeepSeek-derived model exhibits a GPT-seriesbehavioral profile despite Japanese-language fine-tuning; (2) structural response patterns areinvariant across the full compute spectrum, from reduced-resource inference to approximately 60minutes of extended thinking, establishing the metacognitive limitation as architectural rather thancomputational; and (3) the findings converge with independent evidence from CHI 2025 (Shin et al.,2025), where LLMs were found to provide no benefit for problem reframing from a tool-useperspective. We derive implications for AI safety, including a design recommendation to separateframe-level detection from action, and position the Franny Test as an early warning system: the day amodel handles the retroactive definition without structural breakdown is the day the metacognitivebarrier has fallen.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Franny Philos Sophia

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

The Franny Test: A Reproducible Protocol for Detecting Metacognitive Reframing Barriers in Large Language Models

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study