What question did this study set out to answer?

February 2, 2026Open Access

Extraction of urban building in mountainous areas from Sentinel-2 image

Key Points

The aim is to develop a framework for accurate extraction of building information from remote sensing imagery in mountainous areas.
Utilized Sentinel-2 imagery for building extraction.
Leveraged multi-spectral and terrain data to create building masks.
Fine-tuned the Segment Anything Model (SAM) with generated point prompts for improved accuracy.
Evaluated performance based on F1-score and Intersection over Union (IoU) metrics.
Achieved an F1-score of 82.46% and IoU of 70.15% on test datasets.
Outperformed original SAM and EfficientSAM by over 25 and 30 percentage points.
Maintained robust performance on validation datasets with F1-scores above 70% and IoUs around 60%.

Abstract

Abstract Accurate extraction of building information from remote sensing imagery is essential for urban planning and management, yet it remains challenging in mountainous regions due to complex terrain, fragmented settlements, and limited annotated data. Existing methods often require extensive manual labeling or struggle to distinguish buildings from vegetation, shadows, and bare land. To address these issues, we propose a framework that leverages multi-spectral and terrain information to automatically generate coarse-grained building masks and corresponding point prompts, which are then used to fine-tune the Segment Anything Model (SAM) originally trained on millions of natural images. This approach enables accurate extraction of urban buildings in mountainous areas of China with minimal manual annotation. On the test dataset from the same region, our method achieves an F1-score of 82.46 % and an IoU of 70.15 %, outperforming the original SAM and EfficientSAM by more than 25 and 30 percentage points, respectively, and surpassing FCN, UNet, Swin Transformer, and DeepLabV3+ by up to 36 and 41 percentage points. On validation datasets from other regions, the method maintains robust performance with F1-scores above 70 % and IoU around 60 %, consistently higher than competing baselines. The framework is efficient, easy to deploy, and provides a significant step toward practical large-scale building extraction in complex terrains.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Su et al. (Thu,) studied this question.

synapsesocial.com/papers/6980ffb4c1c9540dea812748 https://doi.org/https://doi.org/10.1515/geo-2025-0914

Bookmark

View Full Paper