As a core task in remote sensing image processing, change detection plays a vital role in dynamic surface monitoring for environmental management, urban planning, and agricultural supervision. However, existing methods often suffer from missed detection of small targets and pseudo-change interference, stemming from insufficient modeling of multi-scale feature coupling and spatio-temporal differences due to factors such as background complexity and appearance variations. To this end, we propose a Differential-Perception-Enhanced Multi-Scale Attention Network for Remote Sensing Image Change Detection (MDNet), an optimized framework integrating multi-scale feature extraction, cross-scale aggregation, difference enhancement, and context modeling. Through the parallel collaborative mechanism of the designed Multi-Scale Feature Extraction Module (EMF) and Cross-Scale Adjacent Semantic Information Aggregation Module (CASAM), multi-scale semantic learning is strengthened, enabling fine-grained modeling of change targets of different sizes and improving small-target-detection capability. Meanwhile, the Differential-Perception-Enhanced Module (DPEM) and Transformer structure are introduced for global–local coupled modeling of spatio-temporal differences. They enhance spectral–structural differences to form discriminative features, use self-attention to capture long-range dependencies, and construct multi-level features from local differences to global associations, significantly suppressing pseudo-change interference. Experimental results show that, on three public datasets (LEVIR-CD, WHU-CD, and CLCD), the proposed model exhibits superior detection performance and robustness in terms of quantitative metrics and qualitative analysis compared with existing advanced methods.
Li et al. (Fri,) studied this question.