What question did this study set out to answer?

The research aims to develop a hybrid framework that combines deep reinforcement learning and traditional control methods for safer and more adaptive continuous control.

February 20, 2026Open Access

Specialized Deep Residual Policy Reinforcement Learning Framework for Safe and Adaptive Continuous Control

Key Points

The research aims to develop a hybrid framework that combines deep reinforcement learning and traditional control methods for safer and more adaptive continuous control.
Integrated residual policy learning and cycle of learning approaches
Utilized a specialized reinforcement learning agent for critical states
Validated framework on the Tennessee Eastman process with various analyses
Improved learning efficiency through collaboration between DRL and traditional controllers
Successful synchronization and activation mechanisms observed in experiments
Ablation study demonstrated enhanced policy learning outcomes

Abstract

ABSTRACT Traditional controllers have limitations as they rely on prior knowledge about the physics of the problem, require modelling of dynamics, and struggle to adapt to abnormal situations. Deep reinforcement learning (DRL) offers a promising alternative by learning policies through exploration, but its black‐box nature and reliance on random exploration pose challenges in safety‐critical environments. Recognizing that conventional controllers and DRL have complementary strengths, we propose a novel hybrid framework to overcome challenges in both conventional control systems and DRL. This framework integrates residual policy learning, a cycle of learning approach, and a specialized reinforcement learning agent for safety‐critical, continuous control. Residual policy learning enables collaboration between DRL and conventional controllers, the cycle of learning improves learning efficiency by leveraging expert trajectories, and a specialized reinforcement learning agent optimizes policy learning in critical states using an input–output hidden Markov model. The framework is validated on the Tennessee Eastman process through experiments that analyse synchronization, activation mechanisms and an ablation study.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Ammar N. Abbas

Georgios C. Chasparis

John D. Kelleher

Journals

IET Control Theory and Applications

Actions

Institutions

Trinity College Dublin

Technological University Dublin

Software Competence Center Hagenberg (Austria)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Specialized Deep Residual Policy Reinforcement Learning Framework for Safe and Adaptive Continuous Control

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study