What question did this study set out to answer?

The aim is to improve image super-resolution by reducing feature loss while maintaining model simplicity.

April 25, 2026Open Access

WHANet: Weighted Hierarchical Attention for Lightweight SR

Key Points

The aim is to improve image super-resolution by reducing feature loss while maintaining model simplicity.
Proposed a hierarchical aggregation model (WHA) with learned weighting for feature extraction.
Introduced the Global Residual Self-Attention Block (GRSAB) to capture global information.
Conducted extensive comparative experiments and ablation studies to validate WHA's efficiency.
Achieved a PSNR score of 26.88dB in the ×4 super-resolution task on the Urban100 benchmark.
Showed significant improvement in capturing both local and global features compared to traditional methods.

Abstract

Image super-resolution (SR) is a computer vision task that reconstructs high-resolution (HR) images from low-resolution (LR) images using algorithms. Current Transformer architectures typically employ modules in a sequential arrangement, which often results in the loss of significant effective features across certain layers. Additionally, the Transformer-based models extract feature information through local windows, but they are prone to losing global features. To prevent the loss of valuable features without increasing model complexity, we propose a novel model, Weighted Hierarchy Aggregation(WHA). The core of our WHA is the Hierarchy Aggregation Block (HAB), which builds upon the techniques of residual and dense connections by introducing a learned weighting mechanism that adaptively emphasizes the most valuable features from each layer in the final output. Furthermore, to address the issue that local window self-attention tends to lose global features, we propose a Global Residual Self-Attention Block(GRSAB). The GRSAB can effectively solve the problem of global information loss by introducing two-dimensional channel attention. Our WHA not only accelerates network training but also effectively captures both local and global features, significantly enhancing super-resolution performance. Through extensive comparative experiments and ablation studies, our results demonstrate the high efficiency of HAB, especially in lightweight SR tasks. In the ×4 super-resolution task, our model achieves a 26.88dB PSNR score on the Urban100 benchmark dataset, demonstrating its efficacy through comparative analysis with other models sharing similar architectural frameworks.

Bookmark

View Full Paper