Reinforcement learning is direct adaptive optimal control | Synapse