What does this research mean for the field?

To prevent the collapse of governance objectives into operational actions in Agentic AI, systems require a deterministic 'Governance Harness' that strictly isolates the operational plane from the language model and enforces verifiable runtime governance. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

This research aims to develop a structured governance framework that mitigates alignment risks in agentic AI systems.

May 29, 2026Open Access

The Governance Harness - The Deterministic Pillar for Anchoring Runtime Governance in Agentic AI

Key Points

This research aims to develop a structured governance framework that mitigates alignment risks in agentic AI systems.
Proposes the Governance Harness as a deterministic component between the agent and the language model.
Ensures no direct paths exist from the operational plane to the language model.
Introduces signed per-call attestations for interoperability and verification without disclosing governance details.
The Governance Harness effectively prevents the collapse of governance into operational planes.
Achieves reliable message observation and governance integrity during AI interactions.
Demonstrates that governance is fundamentally structural rather than reliant on cooperation.

Abstract

Agentic AI systems are often focused on productivity and operational efficiency. The risks caused by misalignment between actions in the operational plane and non-operational objectives — policies, ethics, regulatory expectations — are commonly addressed by "just" adding a governance layer that is neither architecturally nor contextually isolated from the operational plane. The failure mode is concrete: prompts injected by the agent or its environment can compromise the language model’sresponse by introducing constructs that lead away from those non-operational objectives. This is the collapse of the governance plane into the operational plane.To prevent this collapse, two functions are mandatory rather than optional: the operational plane must have no direct path to the language model — the Harness must be the only path, even under failure — and every message from and to the operational plane must be forwarded to the governance plane as an observation. The Governance Harness addresses both halves at once: a deterministic equipment component that executes an immutable protocol and has no decision-making power, interposed between an agent and the language model. On every call, it carries the current governance state into the call and emits a signed per-call attestation — the Governance Anchor — that external parties can verify without trusting the operator and without disclosing what governance state contains. Interpretation of the intent and trajectory of observed messages is the Governance System’s role (in this paper, the Governance Twin); the Harness carries and observes but does not interpret. Governance becomes a structural property of the call rather than a property of voluntary cooperation. The architectural claims here are conditional on stated deployment assumptions; engineering embodiments are addressed in the corresponding technical disclosures.

The Governance Harness - The Deterministic Pillar for Anchoring Runtime Governance in Agentic AI

Key Points

Abstract

Cite This Study