March 3, 2026Open Access

Communicating Unexpectedness for Out-of-Distribution Multi-Agent Reinforcement Learning

Key Points

Adaptation to unexpected situations improves with the unexpectedness encoding scheme, enhancing agent communication and effectiveness.
Agents communicate discrepancies between predicted and actual observations to better handle environmental changes, ensuring robust performance.
Through experimental validation on cooperative tasks, the method illustrates effective adaptation to both new challenges and previously unseen conditions.
This decentralized approach indicates potential advancements in multi-agent systems, preparing them for scenarios that were not included in training.

Abstract

Applying multi-agent reinforcement learning (MARL) to real-world scenarios is challenging because agents often need to adapt quickly to unexpected situations, including those rarely or never encountered in training. Recent methods for out-of-distribution generalization are unsuitable for applications on out-of-distribution tasks with limited communication, because they are typically restricted to centralized training or some specialized instances of distribution shifts. To address this limitation, we introduce the Unexpectedness Encoding Scheme, a new decentralized MARL algorithm in which agents communicate ‘‘unexpectedness,’’ the surprising aspects of the environment. In addition to sending their usual reward-driven messages, each agent predicts the next observation based on past experience and then compares this prediction with the actual outcome. The discrepancy between the two is encoded as a message, enabling agents to adapt more effectively to sudden or extreme changes. Experimental results on multi-agent cooperative tasks demonstrate that our method adapts robustly to both dynamically changing training environments and previously unseen out-of-distribution scenarios.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Lee et al. (Thu,) studied this question.

synapsesocial.com/papers/69a760b6c6e9836116a2db6c https://doi.org/https://doi.org/10.1109/access.2026.3660261

Bookmark

View Full Paper