What question did this study set out to answer?

The central aim is to improve stereo image super-resolution, particularly in weakly textured regions.

January 24, 2026

Stereo Image Super-Resolution with Adaptive Multi-Scale Cross-Attention

Key Points

The central aim is to improve stereo image super-resolution, particularly in weakly textured regions.
Proposed the Adaptive Multi-Scale Cross-Attention Stereo Image Super-Resolution Network (AMCASSR).
Implemented an AMSCA module for expanding the receptive field and fusing features.
Utilized a Multi-Scale Cross-Attention Feature Block for intra-view and cross-view integration.
Evaluated the model on various datasets including KITTI2012 and Middlebury.
AMCASSR significantly improved PSNR and SSIM metrics compared to current methods.
Achieved better performance in weak-textured regions.
Validation showed practical applicability in feature and stereo matching tasks.

Abstract

Stereo image super-resolution aims to reconstruct high-resolution images from lowresolution stereo pairs by leveraging complementary information between binocular views, which is essential for a wide range of computer vision applications. To address the limitations in cross-view feature matching of existing methods, particularly in weaktextured regions, we propose the Adaptive Multi-Scale Cross-Attention Stereo Image Super-Resolution Network (AMCASSR). The network comprises two principal modules: the Adaptive Multi-Scale Cross-Attention (AMSCA) module, which enhances reconstruction performance in weak-textured regions by expanding the receptive field and adaptively fusing multi-scale features; and the Multi-Scale Cross-Attention Feature Block (MSCFB), which facilitates the integration of intra-view feature learning and cross-view interaction. Additionally, the network optimizes cross-view interaction while maintaining computational efficiency. Experimental evaluations on the KITTI2012, KITTI2015, Middlebury, and Flickr1024 datasets show that AMCASSR achieves significant improvements in both PSNR and SSIM metrics over current state-of-the-art methods, especially in weak-textured regions. Validation on downstream tasks further supports its practical applicability in feature and stereo matching.

Ask AI

Mark Helpful

Bookmark

Relay