On the different regimes of stochastic gradient descent | Synapse