What question did this study set out to answer?

The central aim is to explore constitutive governance as a non-distillable substrate for ensuring the safety of AI systems.

February 26, 2026Open Access

Constitutive Governance: A Non-Distillable Substrate for Agentic Systems

Key Points

The central aim is to explore constitutive governance as a non-distillable substrate for ensuring the safety of AI systems.
Introduced a formal distinction between expressive and constitutive governance.
Developed the Sovereign Agent Stack framework.
Analyzed the Anthropic incident to illustrate substrate-layer conflict.
Demonstrated the limitations of expressive governance in terms of safety.
Established that constitutive governance is non-portable and context-activated.
Proposed constitutive governance as the optimal foundation for future agentic systems.

Abstract

Recent large-scale extraction attacks against frontier AI systems demonstrate a structural asymmetry: capabilities transfer through distillation, but safeguards do not. This paper introduces constitutive governance, a class of governance that shapes model behavior without appearing in the model's outputs, reasoning traces, or representational patterns. Unlike expressive governance—which is visible, portable, and therefore distillable—constitutive governance is context-activated, privilege-anchored, and event-driven, making it inherently non-representational and non-portable. I formalize the distinction between expressive and constitutive governance, present the Sovereign Agent Stack, and analyze the recent Anthropic incident as the first real-world demonstration of substrate-layer conflict. Constitutive governance is proposed as the only viable foundation for safe, non-extractable agentic systems in the substrate era.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper