Agentic AI systems introduce a class of governance artifact — instruction-plane files such as AGENTS.md — that are treated operationally as high-privilege policy but governed as low-privilege documentation. This mismatch creates a substrate-layer attack surface that is persistent, upstream of any specific user request, invisible to most users, and transitive through supply chains. This paper argues that AGENTS.md-class artifacts are not a prompt injection problem — they are an instruction-plane governance failure — and that the appropriate response is not additional content filtering but explicit governance architecture applied to the instruction plane itself. Part I frames the governance failure: instruction planes, their privilege properties, and the distinction between indirect injection and classic prompt injection. Part II provides an operational drift-detection rubric specifying minimum logging requirements, continuous integrity checks, and alert conditions for detecting substrate-layer drift in agent instruction surfaces.
Building similarity graph...
Analyzing shared references across papers
Loading...
Narnaiezzsshaa Truong
American Rock Mechanics Association
Building similarity graph...
Analyzing shared references across papers
Loading...
Narnaiezzsshaa Truong (Mon,) studied this question.
synapsesocial.com/papers/69e865fd6e0dea528ddea601 — DOI: https://doi.org/10.5281/zenodo.19672118