Towards Formalizing Reinforcement Learning Theory | Synapse