Dealing with uncertainty: Balancing exploration and exploitation in deep recurrent reinforcement learning | Synapse