Key points are not available for this paper at this time.
I present a new way to parallelize the training of convolutional neural networks across multiple GPUs. The method scales significantly better than all alternatives when applied to modern convolutional neural networks.
Alex Krizhevsky (Wed,) studied this question.