On the Convergence of Adaptive Gradient Methods for Nonconvex Optimization | Synapse