Key points are not available for this paper at this time.
A set of neural networks is employed to develop control policies that are better than fixed, theoretically optimal policies, when applied to a combined physical inventory and distribution system in a nonstationary demand environment. Specifically, we show that model-based adaptive critic approximate dynamic programming techniques can be used with systems characterized by discrete valued states and controls. The control policies embodied by the trained neural networks outperformed the best, fixed policies (found by either linear programming or genetic algorithms) in a high-penalty cost environment with time-varying demand.
Building similarity graph...
Analyzing shared references across papers
Loading...
Stephen Shervais
T.T. Shannon
George G. Lendaris
IEEE Transactions on Systems Man and Cybernetics - Part A Systems and Humans
Portland State University
Eastern Washington University
Building similarity graph...
Analyzing shared references across papers
Loading...
Shervais et al. (Sat,) studied this question.
synapsesocial.com/papers/6a21f696505988242b492d33 — DOI: https://doi.org/10.1109/tsmca.2003.809214