Observing building changes in remote sensing images plays a crucial role in monitoring urban development and promoting sustainable urbanization. Mainstream change detection methods have demonstrated promising performance in identifying building changes. However, buildings have large intra-class variance and high similarity with other objects, limiting the generalization ability of models in diverse scenarios. Moreover, most existing methods only detect whether changes have occurred but ignore change types, such as new construction and demolition. To address these issues, we present a building change-type detection network (BCTDNet) based on the Segment Anything Model (SAM) to identify newly constructed and demolished buildings. We first construct a dual-feature interaction encoder that employs SAM to extract image features, which are then refined through trainable multi-scale adapters for learning architectural structures and semantic patterns. Moreover, an interactive attention module bridges SAM with a Convolutional Neural Network, enabling seamless interaction between fine-grained structural information and deep semantic features. Furthermore, we develop a change-aware attribute decoder that integrates building semantics into the change detection process via an extraction decoding network. Subsequently, an attribute-aware strategy is adopted to explicitly generate distinct maps for newly constructed and demolished buildings, thereby establishing clear temporal relationships among different change types. To evaluate BCTDNet’s performance, we construct the JINAN-MCD dataset, which covers Jinan’s urban core area over a six-year period, capturing diverse change scenarios. Moreover, we adapt the WHU-CD dataset into WHU-MCD to include multiple types of changing. Experimental results on both datasets demonstrate the superiority of BCTDNet. On JINAN-MCD, BCTDNet achieves improvements of 12.64% in IoU and 11.95% in F1 compared to suboptimal methods. Similarly, on WHU-MCD, it outperforms second-best approaches by 2.71% in IoU and 1.62% in F1. BCTDNet’s effectiveness and robustness in complex urban scenarios highlight its potential for applications in land-use analysis and urban planning.
Zhang et al. (Thu,) studied this question.