Human Strategy Adaptation in Reinforcement Learning Resembles Policy Gradient Ascent | Synapse