Reinforcement Learning in Medical Settings —A Review of Counterfactual Reward Estimation Methods Based on Causal Graphs | Synapse