What type of study is this?

This is a Quantitative Study study.

September 26, 2025Open Access

Learning to Balance Mixed Adversarial Attacks for Robust Reinforcement Learning

Key Points

Mixed adversarial attacks significantly reduce reinforcement learning agent performance, necessitating robust defenses.
The study introduces ASA-MDP, modeling the interaction as a zero-sum game between agents and adversaries targeting both states and actions.
ASA-PPO is proposed to optimize the adversary's strategy, enhancing the agent's robustness against diverse adversarial scenarios.
Experiments show that ASA-PPO exhibits superior performance compared to standard PPO and single-type methods in mixed-attack situations.

Abstract

Reinforcement learning agents are highly susceptible to adversarial attacks that can severely compromise their performance. Although adversarial training is a common countermeasure, most existing research focuses on defending against single-type attacks targeting either observations or actions. This narrow focus overlooks the complexity of real-world mixed attacks, where an agent’s perceptions and resulting actions are perturbed simultaneously. To systematically study these threats, we introduce the Action and State-Adversarial Markov Decision Process (ASA-MDP), which models the interaction as a zero-sum game between the agent and an adversary attacking both states and actions. Using this framework, we show that agents trained conventionally or against single-type attacks remain highly vulnerable to mixed perturbations. Moreover, we identify a key challenge in this setting: a naive mixed-type adversary often fails to effectively balance its perturbations across modalities during training, limiting the agent’s robustness. To address this, we propose the Action and State-Adversarial Proximal Policy Optimization (ASA-PPO) algorithm, which enables the adversary to learn a balanced strategy, distributing its attack budget across both state and action spaces. This, in turn, enhances the robustness of the trained agent against a wide range of adversarial scenarios. Comprehensive experiments across diverse environments demonstrate that policies trained with ASA-PPO substantially outperform baselines—including standard PPO and single-type adversarial methods—under action-only, observation-only, and, most notably, mixed-attack conditions.

Read Full Paperexternally

Ask AI

Helpful

Bookmark

View Full Paper