Optimizing Sepsis Treatment Through Reinforcement Learning: A Revisitation of Reward and Loss Functions in Dueling Double Deep Q-Network | Synapse