June 1, 2019

Do Better ImageNet Models Transfer Better?

Key Points

Key points are not available for this paper at this time.

Abstract

Transfer learning is a cornerstone of computer vision, yet little work has been done to evaluate the relationship between architecture and transfer. An implicit hypothesis in modern computer vision research is that models that perform better on ImageNet necessarily perform better on other vision tasks. However, this hypothesis has never been systematically tested. Here, we compare the performance of 16 classification networks on 12 image classification datasets. We find that, when networks are used as fixed feature extractors or fine-tuned, there is a strong correlation between ImageNet accuracy and transfer accuracy (r = 0.99 and 0.96, respectively). In the former setting, we find that this relationship is very sensitive to the way in which networks are trained on ImageNet; many common forms of regularization slightly improve ImageNet accuracy but yield features that are much worse for transfer learning. Additionally, we find that, on two small fine-grained image classification datasets, pretraining on ImageNet provides minimal benefits, indicating the learned features from ImageNet do not transfer well to fine-grained tasks. Together, our results show that ImageNet architectures generalize well across datasets, but ImageNet features are less general than previously suggested.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Simon Kornblith

Google (United States)

Jonathon Shlens

Google (United States)

Quoc V. Le

Ton Duc Thang University

Actions

Institutions

Google (United States)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Kornblith et al. (Sat,) studied this question.

synapsesocial.com/papers/69da235e1e2c7d7b4fa3c054 — DOI: https://doi.org/10.1109/cvpr.2019.00277

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Decoupled Weight Decay Regularization· 2017 · 9,101 citations
LIBLINEAR: A Library for Large Linear Classification· 2008 · 6,624 citations
Prototypical Networks for Few-shot Learning· 2017 · 5,205 citations
Collecting a Large-scale Dataset of Fine-grained Cars· 2013 · 96 citations
Deeply-Supervised Nets· 2014 · 1,037 citations

Do Better ImageNet Models Transfer Better?

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider