What type of study is this?

October 13, 2025Open Access

Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning

Key Points

The RL approach increases adversarial example success rates by 19.4%, improving effectiveness in adversarial machine learning tasks.
It also reduces the median query count per example by 53.2%, enhancing efficiency from start to end of training.
Compared to SquareAttack, the method generates examples with a 13.1% greater success rate after training with 5000 episodes.
The study presents a potent new attack vector using RL on ML models, highlighting implications for security.

Abstract

Reinforcement learning (RL) offers powerful techniques for solving complex sequential decision-making tasks from experience. In this paper, we demonstrate how RL can be applied to adversarial machine learning (AML) to develop a new class of attacks that learn to generate adversarial examples: inputs designed to fool machine learning models. Unlike traditional AML methods that craft adversarial examples independently, our RL-based approach retains and exploits past attack experience to improve future attacks. We formulate adversarial example generation as a Markov Decision Process and evaluate RL's ability to (a) learn effective and efficient attack strategies and (b) compete with state-of-the-art AML. On CIFAR-10, our agent increases the success rate of adversarial examples by 19.4% and decreases the median number of victim model queries per adversarial example by 53.2% from the start to the end of training. In a head-to-head comparison with a state-of-the-art image attack, SquareAttack, our approach enables an adversary to generate adversarial examples with 13.1% more success after 5000 episodes of training. From a security perspective, this work demonstrates a powerful new attack vector that uses RL to attack ML models efficiently and at scale.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Kyle Domico

Jean-Charles Noirot Ferrand

Ryan Sheatsley

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider