Key points are not available for this paper at this time.
With the development of communication technology and artificial intelligence, reinforcement learning (RL), as a data-driven control method, has received tremendous attention. The purpose of this survey is to provide an overview of the state-of-the-art policy optimization method for controller design, which is a typical RL method. In particular, we discuss its convergence and sample complexity in certain fundamental optimal control problems in linear systems, such as linear quadratic regulators, output feedback, mathcalH∞control, and distributed control. Additionally, we discuss some future work on the policy optimization for control systems.
Building similarity graph...
Analyzing shared references across papers
Loading...
赵 et al. (Mon,) studied this question.
synapsesocial.com/papers/6a04d799ec645bc7a1368428 — DOI: https://doi.org/10.1360/ssi-2022-0455
斐然 赵
科友 游
Scientia Sinica Informationis
Building similarity graph...
Analyzing shared references across papers
Loading...