March 29, 2024

Optimizing Sepsis Treatment Through Reinforcement Learning: A Revisitation of Reward and Loss Functions in Dueling Double Deep Q-Network

Structured PICO

Does an enhanced DDDQN algorithm with a novel reward function improve survival rates in sepsis treatment compared to clinical strategies?

Population

Sepsis patients from the MIMIC-III dataset

Intervention

Enhanced Dueling Double Deep Q-Network (DDDQN) algorithm with a novel reward function based on clinical treatment recommendations and an optimal loss function

Comparator

Clinical strategies and state-of-the-art value function methods

Outcome

Survival ratehard clinical

An enhanced reinforcement learning algorithm (DDDQN) with a novel reward function may provide more effective treatment strategies for sepsis, potentially improving survival rates compared to standard clinical decisions.

Abstract

With the development of artificial intelligence., an more and more studies are exploring the application of deep reinforcement learning to assist in the clinical treatment of sepsis. Most RL algorithms have shown promising results in formulating treatment decisions on datasets. Previous algorithms based on Dueling Double Deep Q-Network (DDDQN) employed relatively simple reward settings without considering additional patient indicators. Objective: In order to obtain better treatment strategies. Methods: In this study., we enhanced the DDDQN algorithm by introducing a novel reward function based on clinical treatment recommendations and selecting a more optimal loss function. We evaluated our algorithm using state-of-the-art value function methods in expected return., survival rate., and action distribution. Results: The results indicate that DDDQN., with the introduction of the new reward function., achieved higher survival rates in treatment decisions on the MIMIC-III dataset. Conclusion: Reaching up to 93.4%., a notable improvement of approximately 6.3% compared to clinical strategies. Significance: Our approach provides a more effective treatment strategy.

Mark Helpful

Bookmark

Relay