What question did this study set out to answer?

The research aims to improve 3D segmentation accuracy and consistency in virtual environments using interactive methods.

June 3, 2026Open Access

Human‐in‐the‐Loop Object Segmentation for 3D Gaussian Splatting via Finger‐based VR Interface

Key Points

The research aims to improve 3D segmentation accuracy and consistency in virtual environments using interactive methods.
Developed a human-in-the-loop framework combining optimization-based segmentation with a finger-based VR interface.
Implemented fast 3D segmentation running within seconds, enabling real-time user interactions and updates.
Utilized multiview consistent prompts to enhance accuracy and robustness against occlusion and multipart structures.
Achieved significant improvements in segmentation accuracy and semantic consistency compared to traditional methods.
Demonstrated enhanced robustness to occlusions and complex multipart structures in experimental scenarios.
Fine-grained subpart segmentation achieved in cluttered scenes, supporting usability claims.

Abstract

3D Gaussian Splatting has recently emerged as a powerful representation for photorealistic rendering and reconstruction of complex scenes. However, its practical applications in augmented/virtual reality, digital‐twin, and robotics demand accurate and structurally consistent meaningful 3D segmentation, which remains a significant challenge. Existing 3D segmentation approaches, predominantly based on multiview 2D images, frequently rely on appearance‐driven criteria, resulting in semantic misclassification—either incorrectly merging distinct object parts or excessively fragmenting coherent regions. Moreover, these methods significantly struggle with objects with multiple components and occluded scenes. To address these limitations, we propose an interactive human‐in‐the‐loop segmentation framework that combines a fast optimization‐based 3D segmentation algorithm with intuitive finger‐based user interactions within a virtual reality environment. Our optimization‐based segmentation module runs within a few seconds (tens of times faster than existing learning‐based methods) providing users with real‐time visual updates on current segmentation results, enabling them to refine outputs interactively by adjusting prompts and viewpoints in a human‐in‐the‐loop manner. Our finger‐based interface system allows precise 3D spatial prompting, enabling accurate and multiview consistent prompts, thereby overcoming the limitations of traditional 2D multiview prompts and segmentation. This combination significantly improves segmentation accuracy, semantic consistency, and robustness to occlusion and multipart structures, as demonstrated by experimental results showing fine‐grained subpart segmentation in cluttered scenes.

Human‐in‐the‐Loop Object Segmentation for 3D Gaussian Splatting via Finger‐based VR Interface

Key Points

Abstract

Cite This Study