What question did this study set out to answer?

This research aims to develop an occlusion-aware framework for improving human pose estimation in the presence of occlusions using point clouds.

June 18, 2026

Occlusion-robust human pose estimation with synthetic occlusion in point clouds

Key Points

This research aims to develop an occlusion-aware framework for improving human pose estimation in the presence of occlusions using point clouds.
Developed a framework combining PointNet++ for feature extraction and a Transformer for temporal encoding.
Used a graph convolutional network with inverse DCT for skeletal reconstruction from occluded point clouds.
Evaluated the method against an RGB-based baseline under robot-induced occlusions using 128 annotated frames.
Proposed method achieved lower estimation errors at the shoulder and elbow compared to the baseline.
Occlusion augmentation significantly improved performance, reducing sensitivity to occlusions.
Error-visibility correlations remained for the baseline but not for the new method, indicating better robustness.

Abstract

We propose an occlusion-aware framework for human pose estimation based on temporal point-cloud sequences. Training data are generated via simulation and augmented with synthetic occlusions using Perlin-noise masks. The network combines PointNet++ for spatial features extraction, a Transformer for temporal encoding, and a graph convolutional network with inverse DCT for skeletal reconstruction. We evaluate the method against an RGB-based baseline (MediaPipe) under real robot-induced occlusions using 128 annotated frames of right-hand reaching. The proposed method achieves significantly lower errors than the baseline at the shoulder and elbow. An ablation study shows that occlusion augmentation significantly improves performance under occlusion. Visibility analysis further indicates that, after multiple-comparison correction, error-visibility correlations remain for the baseline but not for the proposed method, suggesting reduced sensitivity to occlusion. These results demonstrate the potential of simulation-to-real training for robust single-sensor pose estimation in assistive robotics.

Bookmark

Occlusion-robust human pose estimation with synthetic occlusion in point clouds

Key Points

Abstract

Cite This Study