What question did this study set out to answer?

The aim is to develop a robust handover system that can effectively operate under adverse lighting conditions in human-robot interactions.

April 3, 2026Open Access

Robust Human-to-Robot Handover System Under Adverse Lighting

Read Full Paperexternally

Key Points

The aim is to develop a robust handover system that can effectively operate under adverse lighting conditions in human-robot interactions.
Developed a dual-path perception pipeline using stereo RGB-D and ToF cameras.
Augmented Point Transformer v3 architecture with a T-Net module for improved spatial understanding.
Enhanced GraspNet for optimized grasp selection in H2R scenarios.
Conducted extensive experiments with semantic segmentation datasets and real-world handover trials.
Achieved 84.4% mIoU in semantic segmentation, outperforming the baseline by 3.26 percentage points with faster inference.
Improved handover success rate by 18.4 percentage points compared to the baseline across multiple objects and angles.
Obtained an overall success rate of 82.7% under controlled adverse lighting, surpassing single-camera baselines by up to 39.4 percentage points.
Achieved 75.0% success in comparison to a state-of-the-art multimodal method under the same lighting conditions.

Abstract

Human-to-robot (H2R) handovers are critical in human–robot interaction but are challenged by complex environments that impact robot perception. Traditional RGB-based perception methods exhibit severe performance degradation under harsh lighting (e.g., glare and darkness). Furthermore, H2R handovers occur in unstructured environments populated with fine-grained visual details, such as multi-angle hand configurations and novel object geometries, where conventional semantic segmentation and grasp generation approaches struggle to generalize. To overcome lighting disturbances, we present an H2R handover system with a dual-path perception pipeline. The system fuses perception data from a stereo RGB-D camera (eye-in-hand) and a time-of-flight (ToF) camera (fixed scene) under normal lighting, and switches to the ToF camera for reliable perception under glare and darkness. In parallel, to address the complex spatial and geometric features, we augment the Point Transformer v3 (PTv3) architecture by integrating a T-Net module and a self-attention mechanism to fuse the relative positional angle features between human and robot, enabling efficient real-time 3D semantic segmentation of both the object and the human hand. For grasp generation, we extend GraspNet with a grasp selection module optimized for H2R scenarios. We validate our approach through extensive experiments: (1) a semantic segmentation dataset with 7500 annotated point clouds covering 15 objects and 5 relative angles and tested on 750 point clouds from 15 unseen objects, where our method achieves 84.4% mIoU, outperforming Swin3D-L by 3.26 percentage points with 3.2× faster inference; (2) 250 real-world handover trials comparing our method with the baseline across 5 objects, 5 hand postures, and 5 angles, showing an improvement of 18.4 percentage points in success rate; (3) 450 trials under controlled adverse lighting (darkness and glare), where our dual-path perception method achieves 82.7% overall success, surpassing single-camera baselines by up to 39.4 percentage points; and (4) a comparative experiment against a state-of-the-art multimodal H2R handover method under identical adverse lighting, where our system achieves 75.0% success (15/20) versus the baseline’s 15.0% (3/20), further confirming the lighting robustness of our design. These results demonstrate the system’s robustness and generalization in challenging H2R handover scenarios.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Yifei Wang

Southeast University

Baoguo Xu

Huijun Li

Southeast University

Journals

Biomimetics

Actions

Institutions

Southeast University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Robust Human-to-Robot Handover System Under Adverse Lighting

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study