What question did this study set out to answer?

The central aim is to enhance change detection in remote sensing images by leveraging multi-scale features.

February 6, 2026Open Access

A Multi-Scale Remote Sensing Image Change Detection Network Based on Vision Foundation Model

Key Points

The central aim is to enhance change detection in remote sensing images by leveraging multi-scale features.
Developed a multi-scale change detection network, SAM-MSCD, based on a vision foundation model.
Implemented a parameter fine-tuning strategy using Low-Rank Adaptation for improved model efficiency.
Created a bi-temporal feature interaction module to model change relationships between images from different times.
Integrated a change feature enhancement module to highlight differences across scales.
SAM-MSCD significantly improves change perception accuracy compared to existing methods.
Outperformed state-of-the-art (SOTA) techniques on key metrics like F1-score and Intersection over Union (IoU).
Achieved better performance on four public remote sensing change detection datasets.

Abstract

As a key technology in the intelligent interpretation of remote sensing, remote sensing image change detection aims to automatically identify surface changes from images of the same area acquired at different times. Although vision foundation models have demonstrated outstanding capabilities in image feature representation, their inherent patch-based processing and global attention mechanisms limit their effectiveness in perceiving multi-scale targets. To address this, we propose a multi-scale remote sensing image change detection network based on a vision foundation model, termed SAM-MSCD. This network integrates an efficient parameter fine-tuning strategy with a cross-temporal multi-scale feature fusion mechanism, significantly improving change perception accuracy in complex scenarios. Specifically, the Low-Rank Adaptation mechanism is adopted for parameter-efficient fine-tuning of the Segment Anything Model (SAM) image encoder, adapting it for the remote sensing change detection task. A bi-temporal feature interaction module(BIM) is designed to enhance the semantic alignment and the modeling of change relationships between feature maps from different time phases. Furthermore, a change feature enhancement module (CFEM) is proposed to fuse and highlight differential information from different levels, achieving precise capture of multi-scale changes. Comprehensive experimental results on four public remote sensing change detection datasets, namely LEVIR-CD, WHU-CD, NJDS, and MSRS-CD, demonstrate that SAM-MSCD surpasses current state-of-the-art (SOTA) methods on several key evaluation metrics, including the F1-score and Intersection over Union(IoU), indicating its broad prospects for practical application.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper