Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints | Synapse