What question did this study set out to answer?

The study aims to improve channel pruning techniques to prevent accuracy loss while accelerating CNNs.

April 1, 2026Open Access

Reconstruction and Consolidation Pruning with Feature Reload Mechanism for Efficient Deep CNNs

Key Points

The study aims to improve channel pruning techniques to prevent accuracy loss while accelerating CNNs.
Proposed a pruning framework named Reconstruction and Consolidation Pruning (RCP).
Decoupled the pruning process into a training phase for strategy generation and an inference phase for execution.
Implemented a feature reloading mechanism using 1x1 compensation convolution to adaptively transfer features.
Utilized linear reparameterization to integrate compensation branches into the main network without performance loss.
At 50% FLOPs reduction, RCP led to a 0.84% decrease in accuracy on ResNet-50.
Achieved a 0.07% accuracy improvement at 53% FLOPs reduction for ResNet-56.
Demonstrated effectiveness and superiority of RCP under high compression rates.

Abstract

Channel pruning enables model acceleration by removing channels from convolutional neural networks (CNNs). However, many existing methods adopt a “hard removal” strategy that directly removing low-importance channels, leading to severe feature loss and accuracy degradation. To address this issue, we propose Reconstruction and Consolidation Pruning (RCP), a pruning framework that decouples the pruning process into a pruning-training phase and an inference phase. During pruning training, RCP generates a pruning strategy based on channel importance under a global pruning rate constraint, and constructs a feature reloading mechanism. This mechanism utilizes a learnable 1×1 compensation convolution to adaptively transfer and fuse discriminative features hidden in the pruned channels into the retained channels. In the inference phase, RCP adopts a linear reparameterization strategy to seamlessly consolidate the compensation branches into the main network branch without loss of performance, ensuring zero additional operator overhead during inference. This reversible structural transformation ensures that the training-time augmented architecture and the inference-time compact architecture are functionally identical under linear consolidation. Experimental results show that at 50% FLOPs reduction, RCP incurs only a 0.84% accuracy drop on ResNet-50 (ImageNet-1K), while at 53% FLOPs reduction it achieves a 0.07% accuracy improvement for ResNet-56 (CIFAR-10), validating the proposed method’s effectiveness and superiority under high compression rates.

Reconstruction and Consolidation Pruning with Feature Reload Mechanism for Efficient Deep CNNs

Key Points

Abstract

Cite This Study