September 12, 2022Open Access

ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training

Key Points

Key points are not available for this paper at this time.

Abstract

We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. It is a simple residual network that alternates (i) a linear layer in which image patches interact, independently and identically across channels, and (ii) a two-layer feed-forward network in which channels interact independently per patch. When trained with a modern training strategy using heavy data-augmentation and optionally distillation, it attains surprisingly good accuracy/complexity trade-offs on ImageNet. We also train ResMLP models in a self-supervised setup, to further remove priors from employing a labelled dataset. Finally, by adapting our model to machine translation we achieve surprisingly good results. We share pre-trained models and our code based on the Timm library.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Hugo Touvron

BC Platforms (Finland)

Piotr Bojanowski

Centre National de la Recherche Scientifique

Mathilde Caron

Google (United States)

Journals

IEEE Transactions on Pattern Analysis and Machine Intelligence

Actions

Institutions

Sorbonne Université

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study