What question did this study set out to answer?

The research aims to establish a framework to evaluate the robustness of RAG-based systems against prompt injection attacks.

February 8, 2026Open Access

A Benchmark for Prompt Injection Attacks on RAG-Based Enterprise Assistants: Threat Models, Metrics and Mitigation Strategies

Key Points

The research aims to establish a framework to evaluate the robustness of RAG-based systems against prompt injection attacks.
Developed threat models for insider poisoning and external user uploads.
Constructed adversarial document-query pairs for benchmarking.
Defined security metrics like Injection Success Rate and Policy Violation Rate.
Identified distinct vulnerabilities in RAG systems from prompt injection attacks.
Proposed effective mitigation strategies to enhance system security.
Established a set of evaluation tasks for robust threat analysis.

Abstract

AbstractRetrieval-Augmented Generation (RAG) has become a widely adopted method for deploying Large Language Models (LLMs) in enterprise environments due to its ability to ground outputs in organisational knowledge bases and reduce hallucinations. However, RAG introduces a distinct vulnerability: prompt injection attacks embedded within retrieved documents. In such attacks, adversarial instructions placed inside documents override system policies and cause harmful model behaviour including data leakage, policy violation, misinformation, or unsafe tool execution. This preprint proposes a benchmark-driven framework for evaluating prompt injection robustness in enterprise RAG assistants. It defines enterprise threat models covering insider document poisoning, supply chain document injection, and external user-upload scenarios. The paper proposes dataset construction methodology for adversarial document-query pairs, evaluation tasks, and security metrics such as Injection Success Rate, Policy Violation Rate, Confidentiality Leakage Score, and Grounding Accuracy. Practical mitigation strategies are reviewed including instruction boundary enforcement, retrieval filtering, sanitisation, and verification-based generation. The work supports secure deployment of RAG systems in regulated environments such as finance, healthcare, and public services. Keywords: Retrieval-Augmented Generation, Prompt Injection, LLM Security, Enterprise AI, Cybersecurity, Benchmarking

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Mohammed Faizan Sayeed

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

A Benchmark for Prompt Injection Attacks on RAG-Based Enterprise Assistants: Threat Models, Metrics and Mitigation Strategies

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider