What question did this study set out to answer?

This research addresses governance failures in agentic AI systems, specifically focusing on instruction-plane files and their risks.

April 22, 2026Open Access

Instruction-Plane Governance and Drift Detection for Agentic AI Systems

Read Full Paperexternally

Key Points

This research addresses governance failures in agentic AI systems, specifically focusing on instruction-plane files and their risks.
Identifies the mismatch between high-privilege policy and low-privilege documentation in governance artifacts.
Frames the governance failure through analysis of instruction plane privilege properties.
Presents a drift-detection rubric emphasizing minimum logging and continuous integrity checks.
Argues that AGENTS.md-class artifacts pose a persistent attack surface beyond traditional injection methods.
Establishes that the response requires governance architecture rather than content filtering.
Outlines specific conditions and checks needed to detect drift in agent instruction surfaces.

Abstract

Agentic AI systems introduce a class of governance artifact — instruction-plane files such as AGENTS.md — that are treated operationally as high-privilege policy but governed as low-privilege documentation. This mismatch creates a substrate-layer attack surface that is persistent, upstream of any specific user request, invisible to most users, and transitive through supply chains. This paper argues that AGENTS.md-class artifacts are not a prompt injection problem — they are an instruction-plane governance failure — and that the appropriate response is not additional content filtering but explicit governance architecture applied to the instruction plane itself. Part I frames the governance failure: instruction planes, their privilege properties, and the distinction between indirect injection and classic prompt injection. Part II provides an operational drift-detection rubric specifying minimum logging requirements, continuous integrity checks, and alert conditions for detecting substrate-layer drift in agent instruction surfaces.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Narnaiezzsshaa Truong

Actions

Institutions

American Rock Mechanics Association

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Instruction-Plane Governance and Drift Detection for Agentic AI Systems

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study