What question did this study set out to answer?

The aim is to establish limitations in governing advanced AI systems based on their operational speed and complexity.

January 16, 2026Open Access

The Governance Limit : Physical Foundations of Deterministic AI Safety & A Completeness Theorem for AI Governance Architectures

Key Points

The aim is to establish limitations in governing advanced AI systems based on their operational speed and complexity.
Introduced the concept of the Governance Limit as a constraint on AI governance.
Analyzed AI internal states and their rapid changes in relation to measurable outputs.
Developed a completeness theorem for effective governance through monitoring real-world AI actions.
Proved that deterministic safety can only be achieved at the real-world action boundary of AI.
Showed that semantic properties like alignment cannot be verified effectively at system boundaries.
Demonstrated that effect-magnitude properties fulfill requirements for deterministic governance.

Abstract

The Governance Limit establishes a fundamental constraint: you cannot govern what changes faster than you can measure. AI internal states (intent, alignment, values) change billions of times per second and cannot be observed from outputs—making claims like "this AI is aligned" scientifically unfalsifiable. The paper proves that the only location where deterministic safety is achievable is the physical boundary where AI actions become real-world effects. By placing a trusted monitor at this boundary—checking what the AI does, not what it thinks—safety becomes measurable, testable, and enforceable. This works regardless of how intelligent the AI becomes, because physics constrains effects, not cognition. The paper proves that semantic properties (alignment, intent) are not fiber-constant over the projection from internal states to outputs—meaning they cannot be verified at system boundaries. Only effect-magnitude properties satisfy the requirements for deterministic governance. This is grounded in information theory (non-injective maps destroy distinguishing information), thermodynamics (Landauer's principle makes this irreversible), and causality (the Governance Limit: τ(pattern) ≥ τ(governance)). The result is a completeness theorem: any system claiming deterministic AI safety must implement effect-boundary enforcement, or the claim is unfalsifiable.

Read Full Paperexternally

KI fragen

Bookmark

View Full Paper