Agentic AI systems introduce a class of governance artifact — instruction-plane files such as AGENTS.md — that are treated operationally as high-privilege policy but governed as low-privilege documentation. This mismatch creates a substrate-layer attack surface that is persistent, upstream of any specific user request, invisible to most users, and transitive through supply chains. This paper argues that AGENTS.md-class artifacts are not a prompt injection problem — they are an instruction-plane governance failure — and that the appropriate response is not additional content filtering but explicit governance architecture applied to the instruction plane itself. Part I frames the governance failure: instruction planes, their privilege properties, and the distinction between indirect injection and classic prompt injection. Part II provides an operational drift-detection rubric specifying minimum logging requirements, continuous integrity checks, and alert conditions for detecting substrate-layer drift in agent instruction surfaces.
Building similarity graph...
Analyzing shared references across papers
Loading...
Narnaiezzsshaa Truong (Mon,) studied this question.
synapsesocial.com/papers/69e865fd6e0dea528ddea601 — DOI: https://doi.org/10.5281/zenodo.19672118
Narnaiezzsshaa Truong
American Rock Mechanics Association
Building similarity graph...
Analyzing shared references across papers
Loading...