What type of study is this?

September 10, 2025Open Access

SCCNet: Siamese Networks for Selective Change Captioning in Bi-Temporal Remote Sensing Images

Puntos clave

The proposed method improves accuracy in detecting changes between bi-temporal images, enhancing RSICC tasks.
Performance metrics reveal superior outcomes compared to state-of-the-art techniques in public datasets.
Siamese networks facilitate detailed feature fusion across temporal sequences for reliable change detection.
This framework addresses instability in classification outcomes encountered in previous models, resulting in more reliable interpretations.

Resumen

Abstract Remote Sensing Image Change Captioning (RSICC) is a burgeoning task that aims to articulate change scenarios in bi-temporal remote sensing images using natural languages. The existing methods effectively capture feature differences between bi-temporal remote sensing images and realistic language decoders for accurate interpretation. Notably, not all regions exhibit changes in bi-temporal images, and the presence/absence of changes inherently imposes distinct difficulty levels on the RSICC tasks. Although several existing approaches have discussed this issue, they frequently exhibit the problem of unstable classification outcomes and feature loss during spatiotemporal joint modeling. This paper optimizes the classifier and implements a siamese network and dual-temporal image features fusion module, correlating spatial structures across temporal sequences comprehensively. The proposed framework enables efficient and reliable detection of changed bi-temporal image pairs and generates precise textual descriptions of the identified alterations. The proposed method achieves superior performance on public datasets compared to state-of-the-art methods.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo