Asymptotically optimal reinforcement learning in Block Markov Decision Processes | Synapse