Learning Adversarial MDPs with Stochastic Hard Constraints | Synapse