Implicit policy constraint for offline reinforcement learning | Synapse