SGD with Clipping is Secretly Estimating the Median Gradient | Synapse