Hyperspectral image (HSI) classification requires the extraction of discriminative features from high-dimensional spatial–spectral data. While the Mamba architecture has shown promise in long-sequence modeling with linear complexity, its application to HSI remains constrained by two major hurdles: the unidirectional causal scanning which fails to capture non-causal global dependencies, and the serialization-induced loss of two-dimensional spatial topology and local textures. To overcome these limitations, we propose HAMamba, a novel Hybrid Attention State Space Model. HAMamba facilitates deep representation learning through two core components: a Multi-Scale Dynamic Fusion (MSDF) module and a Hybrid Attention Mamba Encoder (HAME). Specifically, the MSDF module augments spatial perception through parallelized feature extraction and dynamically weighted integration. The HAME synergizes a Bidirectional Sequence Scan Mamba (BSSM) to establish global semantic context and a Spatial–Spectral Gated Attention (SSGA) module to refine local structural details. Comprehensive experiments on four public benchmark datasets demonstrate that the proposed HAMamba significantly outperforms state-of-the-art approaches, achieving a superior balance between classification accuracy and computational efficiency.
Cheng et al. (Sat,) studied this question.