June 7, 2020Open Access

Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains

Key Points

Key points are not available for this paper at this time.

Abstract

This paper describes the application of reinforcement learning (RL) to multi-product inventory management in supply chains. The problem description and solution are both adapted from a real-world business solution. The novelty of this problem with respect to supply chain literature is (i) we consider concurrent inventory management of a large number (50 to 1000) of products with shared capacity, (ii) we consider a multi-node supply chain consisting of a warehouse which supplies three stores, (iii) the warehouse, stores, and transportation from warehouse to stores have finite capacities, (iv) warehouse and store replenishment happen at different time scales and with realistic time lags, and (v) demand for products at the stores is stochastic. We describe a novel formulation in a multi-agent (hierarchical) reinforcement learning framework that can be used for parallelised decision-making, and use the advantage actor critic (A2C) algorithm with quantised action spaces to solve the problem. Experiments show that the proposed approach is able to handle a multi-objective reward comprised of maximising product sales and minimising wastage of perishable products.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Sultana et al. (Sun,) studied this question.

synapsesocial.com/papers/6a1bc26700ee29383e9cd661 — DOI: https://doi.org/10.48550/arxiv.2006.04037

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

A Scalable Reinforcement Learning Algorithm for Scheduling Railway Lines· 2018 · 144 citations
Optimal joint replenishment, delivery and inventory management policies for perishable products· 2014 · 204 citations
Logistics and Retail Management: Emerging Issues and New Challenges in the Retail Supply Chain· 2009 · 156 citations
FeUdal Networks for Hierarchical Reinforcement Learning· 2017 · 252 citations
Intelligent supply chain management using adaptive critic learning· 2003 · 49 citations

Authors

Nazneen N. Sultana

Tata Consultancy Services (India)

Hardik Meisheri

Military Hospital

Vinita Baniwal

Tata Consultancy Services (India)

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Also consider