Constrained Deep Q-Learning Gradually Approaching Ordinary Q-Learning | Synapse