What type of study is this?

This is a Experimental Study study.

September 24, 2025Open Access

HSSTN: A Hybrid Spectral–Structural Transformer Network for High-Fidelity Pansharpening

Key Points

HSSTN significantly improves the quality of pansharpened images by preserving spectral fidelity and enhancing structural details.
Implemented across multiple datasets, HSSTN demonstrated superior performance in visual quality and quantitative metrics compared to existing methods.
Utilizing a dual-stream feature extractor, HSSTN effectively reduces modality discrepancy between multispectral and panchromatic data.
The novel collaborative optimisation loss function developed in HSSTN ensures high integrity of the spectral information during the fusion process.

Abstract

Pansharpening fuses multispectral (MS) and panchromatic (PAN) remote sensing images to generate outputs with high spatial resolution and spectral fidelity. Nevertheless, conventional methods relying primarily on convolutional neural networks or unimodal fusion strategies frequently fail to bridge the sensor modality gap between MS and PAN data. Consequently, spectral distortion and spatial degradation often occur, limiting high-precision downstream applications. To address these issues, this work proposes a Hybrid Spectral–Structural Transformer Network (HSSTN) that enhances multi-level collaboration through comprehensive modelling of spectral–structural feature complementarity. Specifically, the HSSTN implements a three-tier fusion framework. First, an asymmetric dual-stream feature extractor employs a residual block with channel attention (RBCA) in the MS branch to strengthen spectral representation, while a Transformer architecture in the PAN branch extracts high-frequency spatial details, thereby reducing modality discrepancy at the input stage. Subsequently, a target-driven hierarchical fusion network utilises progressive crossmodal attention across scales, ranging from local textures to multi-scale structures, to enable efficient spectral–structural aggregation. Finally, a novel collaborative optimisation loss function preserves spectral integrity while enhancing structural details. Comprehensive experiments conducted on QuickBird, GaoFen-2, and WorldView-3 datasets demonstrate that HSSTN outperforms existing methods in both quantitative metrics and visual quality. Consequently, the resulting images exhibit sharper details and fewer spectral artefacts, showcasing significant advantages in high-fidelity remote sensing image fusion.

HSSTN: A Hybrid Spectral–Structural Transformer Network for High-Fidelity Pansharpening

Key Points

Abstract

Cite This Study