In response to the limitations of few-shot point cloud semantic segmentation, including weak local geometric representation, insufficient utilization of global contextual information, and vulnerability of features to noise interference, we propose a few-shot point cloud segmentation network that combines multi-dimensional perception and frequency-domain enhancement. First, we establish a multi-dimensional perception mechanism by designing a Geometry Awareness Module (GAM) that models local geometric manifolds through multi-scale positional encoding and explicit neighborhood difference modeling. Second, a Global Perception Module (GPM) is introduced, utilizing multi-scale adaptive pooling to capture long-range contextual dependencies. This enables feature refinement that spans from local fine-grained structures to the global scene context. Finally, we construct a Frequency Domain Module (FDM) that utilizes the Fourier transform to disentangle the amplitude spectrum from the phase spectrum and adopts an adaptive spectral enhancement strategy to suppress unstable feature perturbations and enhance boundary-sensitive responses, thereby alleviating the limitations of purely spatial-domain feature representation and enabling complementary learning between spatial-domain and frequency-domain features. Experimental results show that the proposed method achieves competitive performance on the S3DIS and ScanNet benchmarks, with mIoU scores of 50.01% and 42.72% under the 1-way 1-shot setting, respectively. Ablation experiments verify that the network effectively improves segmentation accuracy and prototype robustness in data-scarce and complex indoor environments by means of explicit joint geometric and frequency-domain modeling together with global semantic consistency regularization.
Jia et al. (Sat,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: