Convergence rates for gradient descent in the training of overparameterized artificial neural networks with piecewise affine activation | Synapse