July 10, 2025

A Systematic Survey of Multi-Agent Reinforcement Learning

Key Points

The hybrid method achieved over 85% win rate in StarCraft II, but communication efficiency remains an issue.
Common methods like MADDPG and QMIX are foundational in multi-agent reinforcement learning strategies.
Evaluation criteria include win rate and convergence speed, which are crucial for assessing performance.
Research proposes integrating graph neural networks with meta-learning to tackle challenges in existing methods.

Abstract

Multi-Agent Reinforcement Learning (MARL) solves collaboration and competition problems in complex dynamic environments through distributed decision-making mechanisms, and has made significant progress in recent years in areas such as autonomous driving and robot cluster control. In this paper, we systematically sort out the theoretical framework, mainstream methods (e.g., MADDPG, QMIX), commonly used datasets (SMAC, Pommerman), and evaluation criteria (win rate, convergence speed) of MARL, and analyze the core challenges of the existing methods, such as non-smoothness, and credit allocation. Experiments show that the winning rate of the hybrid method on StarCraft II has reached more than 85%, but the communication efficiency and scalability still need to be improved. This paper proposes the improvement direction of combining graph neural networks and meta-learning for subsequent research.

Demander à l'IA

Bookmark

Cite This Study

Jinsong Leng (Thu,) studied this question.

synapsesocial.com/papers/68af55ccad7bf08b1eadc10a https://doi.org/https://doi.org/10.62051/dvnhyg89

Demander à l'IA

Bookmark