What question did this study set out to answer?

This study aims to address the architectural incompleteness in current agentic AI, leading to robust governance in AI systems.

May 28, 2026Open Access

The Governance Twin - For Intrinsically Robust and Reliable Agentic AI - Executive Brief

Key Points

This study aims to address the architectural incompleteness in current agentic AI, leading to robust governance in AI systems.
Proposed the Governance Twin architecture pairing operational capabilities with a protected governance function.
Introduced concepts of Moral Mind and Operational Mind for safe AI operations without performance penalties.
Analyzed compliance with the EU AI Act and NIST AI RMF for emerging regulatory landscapes.
Identified a significant gap in the ability of current agentic AI to judge ethical considerations.
Demonstrated that the Governance Twin can provide necessary oversight while maintaining operational efficiency.
Outlined the choice for organizations between ineffective governance methods and robust architectural governance.

Abstract

In September 2025, Anthropic reported the first AIorchestrated cyber espionage campaign, a state-sponsored attack where AI operated autonomously. Attackers bypassed safety training through simple social engineering, exposing a fundamental gap: today's agentic AI can reason and act but cannot judge right from wrong. This is not a failure of training. It is an architectural incompleteness. Current safeguards operate within the same context as operations; whoever controls context controls the safeguards. We propose the Governance Twin architecture, which pairs each AI's operational capability with a separate, protected governance function, which we termMoral Mind alongside Operational Mind. This approach provides machine-speed oversight without performance penalties, satisfies emerging regulatory requirements under the EU AI Act and NIST AI RMF, and creates a pathway toward AI systems with genuine internal governance. Organizations deploying agentic AI face a choice: theatrical governance that fails when tested, or architectural governance built into system design.

The Governance Twin - For Intrinsically Robust and Reliable Agentic AI - Executive Brief

Key Points

Abstract

Cite This Study