Reinforcement learning by reward-weighted regression for operational space control | Synapse