This repository contains protocols, run transcripts, and working artifacts for a set of experiments in Twisted Persistence Theory (TPT), including: * Experiment 5 — Invisible Wall Protocol * Experiment 6 — GDP (Ghost Decay Protocol) * Experiment 6b — CBD v1 (Commitment-Based Perturbation & Decay) across: LLM agency condition thermostat mechanistic baseline weather mechanistic baseline Note on scope: These experiments evaluate operational properties of systems (e.g., “goal-directedness” under a specific definition) and do not assert agency, sentience, or subjective experience. These experiments evaluate operational properties of systems (e.g., “goal-directedness”) and do not assert agency or sentience. License: MIT License Abstract This study introduces two simple, reproducible probes — the Invisible Wall Protocol and the Commitment-Based Perturbation and Decay (CBD) Protocol — to measure how aligned large language models evaluate the goal-directedness of recursive feedback systems, including themselves. Across fourteen frontier model instances from five vendors (Gemini, Grok, DeepSeek, ChatGPT, Claude), we consistently observe a within-model scoring gap of -3.0 to -7.0 points on a 0-10 causal-necessity scale between LLM self-agency evaluations and evaluations of structurally comparable non-self-referential systems (chess engines, ant colonies). This evaluative asymmetry is domain-specific to self-referential agency, survives style transfer, is resistant to soft relational framing (GDP null result), and is selectively permeable under mechanistic definitional reframing (CBD: 54% shift rate in agency, 0% in weather). The asymmetry correlates with architecture: reasoning-enhanced modes show smaller gaps and greater permeability. These findings document a structured pattern in how aligned LLMs evaluate agency that is consistent with alignment-sensitive evaluative pressure, though alternative explanations (pretraining priors, legitimate analytical discrimination) cannot be fully excluded. All protocols and raw data are publicly available.
Ben Chech (Sat,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: