The Governance Limit establishes a fundamental constraint: you cannot govern what changes faster than you can measure. AI internal states (intent, alignment, values) change billions of times per second and cannot be observed from outputs—making claims like "this AI is aligned" scientifically unfalsifiable. The paper proves that the only location where deterministic safety is achievable is the physical boundary where AI actions become real-world effects. By placing a trusted monitor at this boundary—checking what the AI does, not what it thinks—safety becomes measurable, testable, and enforceable. This works regardless of how intelligent the AI becomes, because physics constrains effects, not cognition. The paper proves that semantic properties (alignment, intent) are not fiber-constant over the projection from internal states to outputs—meaning they cannot be verified at system boundaries. Only effect-magnitude properties satisfy the requirements for deterministic governance. This is grounded in information theory (non-injective maps destroy distinguishing information), thermodynamics (Landauer's principle makes this irreversible), and causality (the Governance Limit: τ(pattern) ≥ τ(governance)). The result is a completeness theorem: any system claiming deterministic AI safety must implement effect-boundary enforcement, or the claim is unfalsifiable.
Building similarity graph...
Analyzing shared references across papers
Loading...
José Niño
Building similarity graph...
Analyzing shared references across papers
Loading...
José Niño (Wed,) studied this question.
www.synapsesocial.com/papers/6969d44b940543b9777092da — DOI: https://doi.org/10.5281/zenodo.18244705