Reinforcement learning from human reward: Discounting in episodic tasks | Synapse