PAC model-free reinforcement learning | Synapse