Off Policy Lyapunov Stability in Reinforcement Learning | Synapse