Pulse Journal Club Active Debates Trending Explore Researchers

Download the App

Join discussions, follow papers, and never miss your next session.

Download on theApp Store

© Synapse Social LLC, 2026

Home Explore Journal Club Trending

⌘+K

Extremely Large Minibatch SGD: Training ResNet-50 on ImageNet in 15 Minutes | Synapse

November 12, 2017Open Access

Extremely Large Minibatch SGD: Training ResNet-50 on ImageNet in 15 Minutes

Key Points

Key points are not available for this paper at this time.

Abstract

We demonstrate that training ResNet-50 on ImageNet for 90 epochs can be achieved in 15 minutes with 1024 Tesla P100 GPUs. This was made possible by using a large minibatch size of 32k. To maintain accuracy with this large minibatch size, we employed several techniques such as RMSprop warm-up, batch normalization without moving averages, and a slow-start learning rate schedule. This paper also describes the details of the hardware and software of the system used to achieve the above performance.

Mark Helpful

Bookmark

Relay

View Full Paper

Cite This Study

Akiba et al. (Sun,) studied this question.

synapsesocial.com/papers/6a0a9bae36657de66c73762a https://doi.org/https://doi.org/10.48550/arxiv.1711.04325

Mark Helpful

Bookmark

Relay

View Full Paper