What question did this study set out to answer?

The aim is to improve hyperspectral image classification by effectively extracting joint spatial-spectral features.

April 27, 2026Open Access

Hyperspectral image classification network based on multiscale spatial-spectral fusion and semantic enhancement encoder

Key Points

The aim is to improve hyperspectral image classification by effectively extracting joint spatial-spectral features.
Proposed MSNet utilizes multiscale spatial-spectral fusion (MSSF) and a semantic enhancement encoder (SEE).
Principal component analysis reduces dimensionality of hyperspectral images to preserve spectral information.
Feature fusion allows interaction between parallel spatial and spectral branches for comprehensive feature extraction.
Overall accuracy of MSNet reaches 95.68% on Pavia University dataset and 96.84% on Salinas dataset, outperforming existing methods.

Abstract

Abstract Hyperspectral image classification has become a pivotal task in remote sensing data processing. However, current deep learning-based methods still face challenges in effectively extracting discriminative joint spatial-spectral features, especially when handling complex spatial structures and highly variable spectral responses. To address this issue, a novel hyperspectral image classification network (MSNet) based on multiscale spatial-spectral fusion (MSSF) and semantic enhancement encoder (SEE) is proposed. First, principal component analysis is employed to reduce the dimensionality of hyperspectral image, preserving essential spectral information while mitigating noise interference. Second, the proposed MSSF extracts multiscale spatial features through a dedicated spatial branch while simultaneously mining discriminative spectral information via a parallel spectral branch. Feature fusion is subsequently employed to achieve synergistic interaction between spatial and spectral representations, effectively capturing joint characteristics of land covers across multiple scales. Third, the designed SEE incorporates a multi-head attention mechanism to explicitly model global feature dependencies, thereby enhancing the representation of semantically critical regions. Extensive experiments have been conducted on the Pavia University and Salinas datasets. Experimental results demonstrate that MSNet achieves outstanding performance, with overall accuracy reaching 95.68% and 96.84%, respectively, surpassing existing mainstream methods and validating its effectiveness.

Bookmark

View Full Paper

Cite This Study

Yu et al. (Sat,) studied this question.

synapsesocial.com/papers/69eefd9bfede9185760d4580 https://doi.org/https://doi.org/10.1038/s41598-026-46407-y

Bookmark

View Full Paper