What type of study is this?

This is a Quantitative Study study.

October 19, 2025Open Access

Stochastic Linear Quadratic Optimal Control for Continuous‐Time Systems via Reinforcement Learning

Key Points

The proposed reinforcement learning algorithm effectively computes the optimal control law.
It solves the infinite-horizon stochastic linear quadratic control problem, addressing system noise challenges.
The policy iteration approach utilizes real-time data, enabling online learning and convergence to the desired solution.
Numerical examples validate the effectiveness of the approach in achieving desired control outcomes.

Abstract

ABSTRACT This paper aims at solving the infinite‐horizon stochastic linear quadratic (SLQ) optimal control problem online for continuous‐time systems with both additive and multiplicative noises. To eliminate the requirement for prior knowledge of system dynamics, a novel policy iteration approach is proposed, which leverages integral reinforcement learning (RL) techniques to iteratively solve the stochastic algebraic Riccati equation (SARE) using real‐time state and input data. The proposed approach is an off‐policy RL algorithm, where the learning process can be executed by using identical state and input data collected online over fixed time intervals, thereby enabling the optimal control law to be computed. The convergence of the proposed algorithm to the solution of the SARE is verified, and the effectiveness is validated through a numerical example.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Jianglin Yu

Beihang University

Bing‐Chang Wang

Shandong University

Deyuan Meng

Academy of State Administration of Grain

Journals

International Journal of Robust and Nonlinear Control

Actions

Institutions

Shandong University

Beihang University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Stochastic Linear Quadratic Optimal Control for Continuous‐Time Systems via Reinforcement Learning

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study