Airborne LiDAR (Light Detection and Ranging) data is widely used in urban modelling and three-dimensional spatial analysis studies. However, the irregular structure of LiDAR point clouds, varying point densities, and class imbalances observed in the datasets make semantic segmentation problematic. This study addresses the four-class semantic segmentation problem (unclassified, vegetation, ground, and building) on aerial LiDAR point clouds, with a particular focus on multi-class segmentation. The Oregon LiDAR Program dataset was obtained through the OpenTopography platform for use in this study. The point cloud data were resampled to 4096 points to ensure a fixed input size; for each point, the X, Y, and Z coordinates, along with the RGB and intensity features, were utilized. Experimental studies compared the proposed method with both baseline models (PointNet, PointNet++ MSG, and VoxelNet Lite) and recent state-of-the-art architectures, including Point Transformer, KPConv, and RandLA-Net. Additionally, the PointNet2 MSG Transformer model was developed based on the PointNet++ MSG architecture and includes a transformer-based feature fusion module. Different loss functions and training configurations were evaluated, and the effects of ensemble learning and test-time augmentation strategies on model performance were analyzed. The experimental results show that the proposed approach achieved a mean Intersection over Union (IoU) of 51.74% and an accuracy of 61.50% on the test dataset. These results demonstrate that combining multi-scale feature extraction with transformer-based feature fusion is an effective approach for semantic segmentation of LiDAR point clouds and multi-class segmentation tasks.
Sevinç et al. (Wed,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: