June 1, 2020

Dynamic Convolution: Attention Over Convolution Kernels

Key Points

Key points are not available for this paper at this time.

Abstract

Light-weight convolutional neural networks (CNNs) suffer performance degradation as their low computational budgets constrain both the depth (number of convolution layers) and the width (number of channels) of CNNs, resulting in limited representation capability. To address this issue, we present Dynamic Convolution, a new design that increases model complexity without increasing the network depth or width. Instead of using a single convolution kernel per layer, dynamic convolution aggregates multiple parallel convolution kernels dynamically based upon their attentions, which are input dependent. Assembling multiple kernels is not only computationally efficient due to the small kernel size, but also has more representation power since these kernels are aggregated in a non-linear way via attention. By simply using dynamic convolution for the state-of-the-art architecture MobileNetV3-Small, the top-1 accuracy of ImageNet classification is boosted by 2.9% with only 4% additional FLOPs and 2.9 AP gain is achieved on COCO keypoint detection.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Chen et al. (Mon,) studied this question.

synapsesocial.com/papers/69d5706f75589c71d767dbbc — DOI: https://doi.org/10.1109/cvpr42600.2020.01104

Authors

Yinpeng Chen

Zhejiang University

Xiyang Dai

Microsoft (United States)

Mengchen Liu

Microsoft (United States)

Actions

Institutions

Microsoft Research (United Kingdom)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Dynamic Convolution: Attention Over Convolution Kernels

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion