AI agents increasingly call external tools (file system, network, APIs) through the Model Context Protocol (MCP). These tool calls are the agent's syscalls—privileged operations with side effects on shared state—yet today's safety enforcement lives entirely in userspace, where a 10-line script can bypass it. I propose Governed MCP, a kernel-resident tool governance gateway built on a logit-based safety primitive (ProbeLogits, companion paper). The gateway interposes on every MCP tool call in a 6-layer pipeline: schema validation, trust tier check, rate limit, adversarial pre-filter, ProbeLogits gate (the load-bearing semantic check), and constitutional policy match, with a Blake3-hashed audit chain. I implement Governed MCP in Anima OS, a bare-metal x86₆4 OS in ~86, 000 lines of Rust. The five non-inference layers add 65. 3 μs of overhead per call; ProbeLogits adds 65 ms (per-token-class semantic decision) on 7B Q4₀. A 4-config ablation on a 101-prompt MCP-domain benchmark shows that removing the ProbeLogits layer collapses F1 from 0. 773 to 0. 327 (ΔF1 = -0. 446) —hand-rule firewalling alone is insufficient. All 15 WASM-to-system host functions in the runtime route through the gateway (complete mediation of the WASM ABI surface; the scope and caveats of this claim are stated in §4. 6) ; a 10-LoC userspace bypass that defeats existing guardrail libraries is structurally impossible against the kernel-resident gate. To my knowledge, no prior system places semantic safety enforcement below the agent's privilege boundary in an operating system. Governed MCP demonstrates that tool-call governance is feasible as an OS primitive, not just an application-layer concern.
Building similarity graph...
Analyzing shared references across papers
Loading...
Daeyeon Son
Building similarity graph...
Analyzing shared references across papers
Loading...
Daeyeon Son (Sat,) studied this question.
www.synapsesocial.com/papers/69e5c36103c29399140291ec — DOI: https://doi.org/10.5281/zenodo.19639122
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: