Optimistic Posterior Sampling for Reinforcement Learning: Worst-Case Regret Bounds | Synapse