What question did this study set out to answer?

The central aim is to evaluate the scalability and transferability of SegFormer models in urban scene segmentation across diverse datasets.

March 3, 2026Open Access

Evaluating Architecture Scalability and Transfer Learning in Urban Scene Segmentation Using Explainable AI

Key Points

The central aim is to evaluate the scalability and transferability of SegFormer models in urban scene segmentation across diverse datasets.
Analyzed variants of SegFormer (B3, B4, B5) using CamVid dataset.
Performed cross-dataset transfer learning to KITTI and IDD datasets.
Evaluated class-level performance and generated confidence heatmaps for explainability.
SegFormer-B5 achieved the highest accuracy of 82.4% mIoU on CamVid.
Transfer learning from CamVid improved mIoU on KITTI by 2.57%.
Class-specific predictions enhanced in IDD by over 70%.

Abstract

Semantic segmentation plays a pivotal role in autonomous driving, enabling pixel-level understanding of road scenes. Although transformer-based models such as SegFormer have shown exceptional performance on large datasets, their generalization to smaller and geographically diverse datasets remains underexplored. In this work, we analyze the scalability and transferability of SegFormer variants (B3, B4, B5) using CamVid as the base dataset. We perform cross-dataset transfer learning to KITTI and IDD, evaluate class-level performance, and explore explainable AI via confidence heatmaps. Our findings show that SegFormer-B5 achieves the highest accuracy (82.4% mIoU) on CamVid, while transfer learning from CamVid improves mIoU on KITTI by 2.57% and enhances class-specific predictions in IDD by over 70%. These results highlight the practical potential of SegFormer in real-world segmentation systems and the interpretability benefits of confidence-based visual analysis.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Hatkar et al. (Sun,) studied this question.

synapsesocial.com/papers/69a67f12f353c071a6f0af73 https://doi.org/https://doi.org/10.3390/bdcc10030075

Bookmark

View Full Paper