Thompson Sampling for Infinite-Horizon Discounted Decision Processes | Synapse