What question did this study set out to answer?

This research aims to improve semantic segmentation accuracy in urban driving scenarios by enhancing edge representation and local context.

June 14, 2026

Local feature-aware and edge-enhanced semantic segmentation for autonomous driving

Key Points

This research aims to improve semantic segmentation accuracy in urban driving scenarios by enhancing edge representation and local context.
Proposed a model that enhances edge feature representation and incorporates local spatial context.
Introduced Multi-scale Central Difference Convolution (MS-CDC) for fusing multi-scale edge features.
Designed a Local Feature Extraction (LFE) module to capture pixel-wise relationships and extract finer-grained pixel features.
Achieved 80.67% mIoU on the Cityscapes validation set and 45.5% mIoU on the Mapillary Vista validation set.

Abstract

Semantic segmentation in urban scenes is an important task in computer vision. However, urban road scenes still present many challenges, such as category imbalance and complex backgrounds. These problems lead to unclear edge segmentation and inaccurate classification of occluded objects in existing semantic segmentation methods for urban scenes, which limits their accuracy and robustness in practical applications. In this paper, we propose a model that recursively enhances edge feature representation while incorporating local spatial context. To address the problem of unclear edge segmentation, we introduce Multi-scale Central Difference Convolution (MS-CDC) to fuse multi-scale edge features. The feature pyramid-based FeedBack Connection (FBC) module fuses multi-scale features while recursively enhancing the original network, thereby improving the robustness of the model to occluded objects. Meanwhile, we design a Local Feature Extraction (LFE) module to capture pixel-wise relationships by constructing local pixel graphs and center pixel graphs. It can learn local contextual information to extract finer-grained pixel features. Experimental results on the Cityscapes and Mapillary Vista datasets validate the effectiveness of the proposed model. Our model achieves 80.67% and 45.5% mIoU on the validation sets of Cityscapes and Mapillary Vista, respectively. We open-source our code at https://github.com/sanmanaa/segmentation-autodriving-graph-centralconv .

Bookmark

Local feature-aware and edge-enhanced semantic segmentation for autonomous driving

Key Points

Abstract

Cite This Study