From free energy to expected energy: Improving energy-based value function approximation in reinforcement learning | Synapse