The Governance Limit establishes a fundamental constraint: you cannot govern what changes faster than you can measure. AI internal states (intent, alignment, values) change billions of times per second and cannot be observed from outputs—making claims like "this AI is aligned" scientifically unfalsifiable. The paper proves that the only location where deterministic safety is achievable is the physical boundary where AI actions become real-world effects. By placing a trusted monitor at this boundary—checking what the AI does, not what it thinks—safety becomes measurable, testable, and enforceable. This works regardless of how intelligent the AI becomes, because physics constrains effects, not cognition. The paper proves that semantic properties (alignment, intent) are not fiber-constant over the projection from internal states to outputs—meaning they cannot be verified at system boundaries. Only effect-magnitude properties satisfy the requirements for deterministic governance. This is grounded in information theory (non-injective maps destroy distinguishing information), thermodynamics (Landauer's principle makes this irreversible), and causality (the Governance Limit: τ(pattern) ≥ τ(governance)). The result is a completeness theorem: any system claiming deterministic AI safety must implement effect-boundary enforcement, or the claim is unfalsifiable.
José Niño (Wed,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: