The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning | Synapse