Prompt injection has rapidly emerged as a critical security threat in the deployment of large language models (LLMs), enabling adversaries to subvert intended behaviors and bypass safety mechanisms. Despite the increased attention that this threat has received, no previous studies have systematically analyzed the field. This paper presents the first systematic literature review (SLR) on prompt injection, with the objective of facilitating a comprehensive, evidence-based understanding of the existing attacks and defenses in LLM among researchers and practitioners.We extensively searched databases like ACM Digital Library, ScienceDirect and Web of Science, initially screening 207 studies and ultimately focusing on 56 relevant papers, based on rigorous inclusion, exclusion, and quality criteria. The analysis is structured around three core research questions: (i) the taxonomical classification of prompt injection attacks, (ii) the identification of recent and innovative attack techniques, and (iii) the characterization of proposed defense mechanisms. The findings reveal a rapidly evolving and multi-layered threat landscape, encompassing obfuscation strategies, automated and multi-modal attacks, and psychologically manipulative prompts. In response, the literature proposes a range of defenses, including input-level sanitization, model-level filtering, prompt engineering, classification-based approaches, and architectural safeguards. Future research should focus on establishing robust standardization in both theory and experimentation, addressing the heterogeneity in attack classification and defense evaluation, while promoting empirical and quantitative approaches to assess effectiveness and considering user privacy and ethical implications.
Building similarity graph...
Analyzing shared references across papers
Loading...
Carmine Ambrosino
University of Salerno
Building similarity graph...
Analyzing shared references across papers
Loading...
Carmine Ambrosino (Mon,) studied this question.
www.synapsesocial.com/papers/69f2a49d8c0f03fd677639ff — DOI: https://doi.org/10.5281/zenodo.19856514