Hyperspectral target detection (HTD) aims to identify pixel-level targets within complex backgrounds, but existing HTD methods often fail to fully exploit multi-scale features and integrate global–local information, leading to suboptimal detection performance. To address these challenges, a novel hybrid Transformer–Mamba network (HTMNet) is proposed to reconstruct the high-fidelity background samples for HTD. HTMNet consists of the following two parallel modules: the multi-scale feature extraction (MSFE) module and the global–local feature extraction (GLFE) module. Specifically, in the MSFE module, we designed a multi-scale Transformer to extract and fuse multi-scale background features. In the GLFE module, a global feature extraction (GFE) module is devised to extract global background features by introducing a spectral–spatial attention module in the Transformer. Meanwhile, a local feature extraction (LFE) module is developed to capture local background features by incorporating the designed circular scanning strategy into the LocalMamba. Additionally, a feature interaction fusion (FIF) module is devised to integrate features from multiple perspectives, enhancing the model’s overall representation capability. Experiments show that our method achieves AUC(PF, PD) scores of 99.97%, 99.91%, 99.82%, and 99.64% on four public hyperspectral datasets. These results demonstrate that HTMNet consistently surpasses state-of-the-art HTD methods, delivering superior detection performance in terms of AUC(PF, PD).
Zheng et al. (Sat,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: