July 17, 2022

Semantic Segmentation of High-Resolution Remote Sensing Images Using an Improved Transformer

Key Points

Key points are not available for this paper at this time.

Abstract

Semantic segmentation has been widely researched for high level analysis of High Spatial Resolution (HSR) remote sensing images, where Convolutional Neural Network (CNN) is the mainstream method. However, the transformer with attention mechanism has its unique capacity of extracting global information which is generally ignored by CNN models. In this paper, a Swin Transformer with UPer head (STUP) is proposed to tackle with semantic segmentation problem on a challenging remote sensing land-cover dataset called LoveDA, which owns complex background samples and inconsistent classes distributions. The proposed STUP combines the Swin Transformer with Uper Head in the form of an encoder-decoder structure, to extract features of HSR images for segmentation. Furthermore, Focal Loss is adopted to handle the unbalanced distribution problem in the training step. Experimental results demonstrate that the proposed STUP clearly outperforms several state-of-the-art models.

AI에게 질문

Bookmark

Cite This Study

Liu et al. (Sun,) studied this question.

synapsesocial.com/papers/69ff9a70581c6e761e777d79 https://doi.org/https://doi.org/10.1109/igarss46834.2022.9884103

Also Consider

Synapse has enriched one closely related paper. Consider it for comparative context:

1Fully convolutional networks for semantic segmentation2015 · 36,909 citations

AI에게 질문

Bookmark