May 28, 2024Open Access

Safe Reinforcement Learning in Black-Box Environments via Adaptive Shielding

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

Empowering safe exploration of reinforcement learning (RL) agents during training is a critical impediment towards deploying RL agents in many real-world scenarios. Training RL agents in unknown, black-box environments poses an even greater safety risk when prior knowledge of the domain/task is unavailable. We introduce ADVICE (Adaptive Shielding with a Contrastive Autoencoder), a novel post-shielding technique that distinguishes safe and unsafe features of state-action pairs during training, thus protecting the RL agent from executing actions that yield potentially hazardous outcomes. Our comprehensive experimental evaluation against state-of-the-art safe RL exploration techniques demonstrates how ADVICE can significantly reduce safety violations during training while maintaining a competitive outcome reward.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Daniel Bethell

University of York

Simos Gerasimou

Cyprus University of Technology

Radu Călinescu

University of York

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Safe Reinforcement Learning in Black-Box Environments via Adaptive Shielding

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study