Learning Algorithms for Markov Decision Processes with Average Cost | Synapse