Key points are not available for this paper at this time.
Problem. Deep Reinforcement Learning (DRL) algorithms are increasingly being used in safety-critical systems. Ensuring the safety of DRL agents is a critical concern in such contexts. However, relying solely on testing is not sufficient to ensure safety as it does not offer guarantees. Building safety monitors is one solution to alleviate this challenge. Existing safety monitoring techniques for regular software systems often rely on formal verification to ensure compliance with safety constraints 4. However, when it comes to DRL policies, formally verifying their behavior to satisfy safety properties becomes an NP-complete problem 6. Further, monitoring DRL agents in a black-box manner is practically important, as testers and safety engineers often do not have full access to the internals nor the training dataset of the DRL agent 2, 8.
Building similarity graph...
Analyzing shared references across papers
Loading...
Amirhossein Zolfagharian
Manel Abdellatif
Damietta University
Lionel Briand
University of Ottawa
University of Ottawa
General Motors (United States)
University of Limerick
Building similarity graph...
Analyzing shared references across papers
Loading...
Zolfagharian et al. (Sun,) studied this question.
synapsesocial.com/papers/68e6f3b7b6db64358766eb43 — DOI: https://doi.org/10.1145/3639478.3643072