Key points are not available for this paper at this time.
We present a simple but powerful architecture of convolutional neural network, which has a VGG-like inference-time body composed of nothing but a stack of 3 × 3 convolution and ReLU, while the training-time model has a multi-branch topology. Such decoupling of the training-time and inference-time architecture is realized by a structural re-parameterization technique so that the model is named RepVGG. On ImageNet, RepVGG reaches over 80% top-1 accuracy, which is the first time for a plain model, to the best of our knowledge. On NVIDIA 1080Ti GPU, RepVGG models run 83% faster than ResNet-50 or 101% faster than ResNet-101 with higher accuracy and show favorable accuracy-speed trade-off compared to the state-of-the-art models like EfficientNet and RegNet. The code and trained models are available at https://github.com/megvii-model/RepVGG.
Building similarity graph...
Analyzing shared references across papers
Loading...
Xiaohan Ding
George Mason University
Xiangyu Zhang
Megvii (China)
Ningning Ma
Chapingo Autonomous University
Tsinghua University
Hong Kong University of Science and Technology
Aberystwyth University
Building similarity graph...
Analyzing shared references across papers
Loading...
Ding et al. (Tue,) studied this question.
synapsesocial.com/papers/69d85fc505ee2ba81dbefa00 — DOI: https://doi.org/10.1109/cvpr46437.2021.01352