Semi-pessimistic Reinforcement Learning | Synapse