What type of study is this?

September 10, 2025

Localized Multi-Agent Reinforcement Learning for Cooperative Management of Supply Chains

Key Points

Localized multi-agent reinforcement learning improves collaboration and efficiency in supply chain management, particularly in dynamic environments.
The proposed SNAC algorithm employs local observations and reduces reliance on global information, facilitating scalable coordination across agents.
The study formulates the coordination problem as a Markov decision process, uncovering properties that enhance agents' learning with limited communication.
Numerical experiments validate the effectiveness of the SNAC algorithm in managing the complexities of serial supply chains under varying conditions.

Abstract

While reinforcement learning has significant applications in smart manufacturing, effectively coordinating multiple agents with limited communication remains a significant challenge. In this paper, we propose a localized multi-agent reinforcement learning approach specifically designed for serial supply chains. We formulate the supply chain management problem as a Markov decision process and uncover the exponential decay property of the Formula: see text-functions, which allows each agent to approximate the Formula: see text-functions using local observations and communications. Then, we propose the scalable natural actor–critic (SNAC) algorithm to solve the problem. The SNAC algorithm leverages localized coordination and reduces reliance on global information, thus addressing the challenges of large-scale and dynamic supply chain environments. Additionally, we conduct numerical experiments to demonstrate the effectiveness of SNAC in managing serial supply chains.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Rongjinzi Wang

Ruiyang Jin

Jie Song

Journals

Asia Pacific Journal of Operational Research

Actions

Institutions

Peking University

City University of Hong Kong

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Localized Multi-Agent Reinforcement Learning for Cooperative Management of Supply Chains

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study