Key points are not available for this paper at this time.
In this paper distributed adaptive linear quadratic control of discrete-time linear large-scale systems with unknown dynamics using distributed reinforcement learning is studied. Linear quadratic control based on dynamic programming (specifically policy iteration) and adaptive linear quadratic control based on reinforcement learning (especially Q learning) are reviewed first. Then distributed adaptive linear quadratic control is addressed. Two Q functions exploiting the quadratic structure of the value function and leading to a decentralized and a distributed policy are proposed and a decentralized as well as a distributed Q learning algorithm are presented. Finally the concepts are evaluated in a simulation study. The simulation results indicate that the distributed policy is near-optimal.
Daniel Görges (Tue,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: