Generalizable Policy Improvement via Reinforcement Sampling (Student Abstract) | Synapse