What question did this study set out to answer?

The aim is to explore how humans can serve as safety constraints in reinforcement learning for critical applications.

January 25, 2026Open Access

Humans as Safety Constraints: A Survey of Human-in-the-Loop Reinforcement Learning for Critical Systems

Key Points

The aim is to explore how humans can serve as safety constraints in reinforcement learning for critical applications.
Conducted a systematic PRISMA-based review of 100 studies from 2010–2025
Analyzed traditional vs. human-in-the-loop reinforcement learning approaches
Introduced the Human Safety Constraint Framework (HSCF)
Identified gaps in traditional algorithmic safety approaches
Illustrated case studies showing how human intervention reduces risks
Provided recommendations for building scalable, certifiable safety architectures

Abstract

This preprint surveys the role of humans as explicit safety constraints in reinforcement learning (RL) for safety-critical systems. Unlike traditional human-in-the-loop RL approaches that focus on learning efficiency, this work emphasizes human oversight to prevent catastrophic outcomes in domains such as autonomous driving, medical robotics, and industrial control. Using a systematic PRISMA-based review of 100 studies from 2010–2025, the article identifies gaps in purely algorithmic safety approaches and introduces the Human Safety Constraint Framework (HSCF), which formalizes human roles as preventive, corrective, advisory, and normative constraints. Case studies illustrate how human intervention mitigates residual risks, and the survey concludes with recommendations for developing scalable, certifiable hybrid human-algorithm safety architectures.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Kenneth Besigomwe (Fri,) studied this question.

synapsesocial.com/papers/6975b2eafeba4585c2d6e575 https://doi.org/https://doi.org/10.5281/zenodo.18354867

Bookmark

View Full Paper