April 23, 2024Open Access

Reinforcement Learning with Adaptive Control Regularization for Safe Control of Critical Systems

Key Points

Key points are not available for this paper at this time.

Abstract

Reinforcement Learning (RL) is a powerful method for controlling dynamic systems, but its learning mechanism can lead to unpredictable actions that undermine the safety of critical systems. Here, we propose RL with Adaptive Control Regularization (RL-ACR) that ensures RL safety by combining the RL policy with a control regularizer that hard-codes safety constraints over forecasted system behaviors. The adaptability is achieved by using a learnable "focus" weight trained to maximize the cumulative reward of the policy combination. As the RL policy improves through off-policy learning, the focus weight improves the initial sub-optimum strategy by gradually relying more on the RL policy. We demonstrate the effectiveness of RL-ACR in a critical medical control application and further investigate its performance in four classic control environments.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Haozhe Tian

Beihang University

Homayoun Hamedmoghadam

Robert Shorten

Dyson (United Kingdom)

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Reinforcement Learning with Adaptive Control Regularization for Safe Control of Critical Systems

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider