On the Stability of Gradient Descent for Large Learning Rate | Synapse