Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination | Synapse