What question did this study set out to answer?

This research aims to enhance the classification accuracy and efficiency of land feature extraction from high-resolution remote sensing images using a novel Transformer-based algorithm.

June 1, 2026Open Access

Automatic Algorithm for Land Feature Classification and Map Feature Extraction of High-Resolution Remote Sensing Imagery Based on Transformer

Puntos clave

This research aims to enhance the classification accuracy and efficiency of land feature extraction from high-resolution remote sensing images using a novel Transformer-based algorithm.
Utilized the Transformer model as the core for feature extraction and classification.
Improved location encoding for spatial adaptation and designed a multi-scale feature fusion unit.
Introduced an adaptive loss function for optimized model training.
In building classification, boundary localization accuracy reached 91.2%, improved from 85.6% using Swin Transformer.
Significant enhancement in both classification accuracy and extraction efficiency observed across multiple land cover types.

Resumen

With the rapid development of remote sensing technology, high-resolution remote sensing images have shown great application value in natural resource planning, environmental monitoring, fire rescue and other fields due to their rich spatial detail information. However, such images contain diverse land feature types and complex spatial distributions. Traditional algorithms are prone to losing detailed information during feature extraction and lack adaptability to complex scenes, resulting in land feature classification accuracy and map feature extraction efficiency failing to meet practical needs. This paper uses the Transformer model as the core and constructs a technical framework of "feature enhancement - multi-scale fusion - accurate classification and extraction": First, the location encoding module of Transformer is improved to adapt to the spatial characteristics of remote sensing images; second, a multi-scale feature fusion unit is designed, combining the advantages of CNN local feature extraction with the global dependency modeling capability of Transformer; finally, an adaptive loss function is proposed to optimize the model training process. Experiments were conducted using the publicly available high-resolution remote sensing dataset WHU-SEN-City and a self-made UAV image dataset. Results show that the proposed MSA-ST algorithm demonstrates significant advantages in the classification and extraction of multiple land cover types: in building classification, its boundary localization accuracy reaches 91.2%, a 5.6 percentage point improvement over Swin Transformer’s 85.6%, enabling efficient land cover identification and feature extraction in complex scenes.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo