With the advancement of 3D digital dentistry, accurate 3D tooth segmentation has become increasingly important in orthodontics and computer-aided diagnosis. However, existing supervised approaches heavily rely on exhaustive face-wise annotations and often exhibit limited generalization across complex clinical meshes. Although self-supervised learning offers a promising alternative to alleviate annotation costs, current paradigms remain challenged by sensitivity to data augmentations, suboptimal representation learning in pure masking schemes, and the complex structural characteristics of dental geometry. To address these limitations, we propose Dental-CMAE, a graph-enhanced hierarchical Contrastive masked AutoEncoder framework tailored for 3D tooth segmentation. The framework incorporates a dual-branch masking strategy that leverages graph-based structural priors to generate distinct corrupted views while preserving intrinsic mesh topology, thereby facilitating robust reconstruction. This is integrated with a feature-level contrastive objective designed to enforce semantic consistency between co-masked regions, which enhances representation discriminability without the requirement for negative sample queues. Additionally, the architecture utilizes a hierarchical multi-scale attention mechanism that partitions feature channels into parallel streams, enabling the simultaneous capture of fine-grained morphological variations and the overarching global dental arch context. Extensive experiments demonstrate that our Dental-CMAE consistently outperforms state-of-the-art fully supervised and self-supervised methods across multiple evaluation metrics. Specifically, our framework achieves an Overall Accuracy (OA) of 95.57%, a mean Intersection-over-Union (mIoU) of 88.14%, and a mean Accuracy (mAcc) of 90.85%. Supported by these quantitative findings, our method validates its effectiveness for robust 3D tooth segmentation, highlighting its strong potential to alleviate annotation bottlenecks and improve the reliability of automated 3D digital dental workflows.
Li et al. (Mon,) studied this question.