What does this research mean for the field?

MDGroup achieves state-of-the-art performance in 3D point cloud instance segmentation, particularly for small objects and complex boundaries. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.SUPPORTS_CONSENSUS.

What question did this study set out to answer?

This research aims to enhance instance segmentation of 3D point clouds by addressing challenges related to geometry and boundaries.

February 26, 2026Open Access

MDGroup: Multi-Grained Dual-Aware Grouping for 3D Point Cloud Instance Segmentation

Puntos clave

This research aims to enhance instance segmentation of 3D point clouds by addressing challenges related to geometry and boundaries.
Developed Multi-grained Dual-aware Grouping algorithm (MDGroup) for segmentation.
Utilized Dual-Resolution 3D U-Net to maintain geometric detail while aligning global semantics.
Implemented a four-branch prediction scheme for improved boundary and directional cues.
Employed Hierarchical Adaptive Multi-grained Feature fusion for efficient cross-scale alignment.
Introduced Temporal Adaptive Gating to support dynamic scenes.
MDGroup achieves state-of-the-art performance in instance segmentation on various benchmarks.
Notably improves accuracy on small objects and complex boundaries.
Demonstrates effectiveness in dynamic environments compared to existing methods.

Resumen

Instance segmentation of 3D point clouds is a fundamental task for scene understanding in applications such as autonomous driving, robotics, and augmented reality. The inherent irregularity and sparsity of point clouds, compounded by scale variations and instance adhesion, pose significant challenges to accurate segmentation. Existing grouping-based methods are often limited by the loss of geometric details in single-path backbones and by error propagation near complex boundaries. To address these issues, a Multi-grained Dual-aware Grouping algorithm (MDGroup) is proposed, which explicitly integrates multi-grained feature representation with dual awareness of class and boundary. The algorithm features a Dual-Resolution 3D U-Net (DRNet) that preserves local geometric details while aggregating global semantics through adaptive alignment. A four-branch prediction scheme enhances semantic and offset estimation with boundary and directional cues, enabling fine-grained boundary modeling. Furthermore, a Hierarchical Adaptive Multi-grained Feature fusion framework (HAMF) achieves efficient cross-scale alignment by combining Class-Aware Dynamic Voxelization and Class-Aware Pyramid Scaling. Finally, a Boundary-Aware Weighted Aggregation mechanism (BAWA) refines instance grouping by dynamically weighting semantic confidence, geometric distance, boundary probability, and directional consistency. To extend the model to dynamic scenes, a Temporal Adaptive Gating (TAG) module is introduced to leverage historical frame correlations. Extensive experiments on the ScanNet v2, S3DIS, STPLS3D, SemanticKITTI, LiDAR-Net, and OCID benchmarks demonstrate that MDGroup achieves state-of-the-art performance among grouping-based methods, particularly on small objects, complex boundaries, and dynamic environments.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo