June 2, 2024Open Access

Teams of LLM Agents can Exploit Zero-Day Vulnerabilities

Key Points

Key points are not available for this paper at this time.

Abstract

LLM agents have become increasingly sophisticated, especially in the realm of cybersecurity. Researchers have shown that LLM agents can exploit real-world vulnerabilities when given a description of the vulnerability and toy capture-the-flag problems. However, these agents still perform poorly on real-world vulnerabilities that are unknown to the agent ahead of time (zero-day vulnerabilities). In this work, we show that teams of LLM agents can exploit real-world, zero-day vulnerabilities. Prior agents struggle with exploring many different vulnerabilities and long-range planning when used alone. To resolve this, we introduce HPTSA, a system of agents with a planning agent that can launch subagents. The planning agent explores the system and determines which subagents to call, resolving long-term planning issues when trying different vulnerabilities. We construct a benchmark of 15 real-world vulnerabilities and show that our team of agents improve over prior work by up to 4. 5.

Read Full Paperexternally

AI에게 질문

Bookmark

View Full Paper

Cite This Study

Fang et al. (Sun,) studied this question.

synapsesocial.com/papers/68e66854b6db6435875f4d03 https://doi.org/https://doi.org/10.48550/arxiv.2406.01637

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

AI에게 질문

Bookmark

View Full Paper