Optimal rates of convergence for entropy regularization in discounted Markov decision processes | Synapse