Abstract In unstructured environments, Delta robots face challenges in achieving high vision-guided grasping precision due to dynamic lighting conditions and workpiece diversity. This paper designs an integrated solution that combines RGB-D multimodal learning with an enhanced Mask R-CNN framework. Initially, a dual-stream ResNet50-FPN backbone network is designed to achieve cross-modal adaptive alignment via hierarchical feature fusion. Subsequently, a depth-guided attention module is incorporated to bolster robustness against material ambiguity and reflective interference. Moreover, a dynamic depth estimation algorithm is employed to significantly improve target localization accuracy and stability. Finally, real-time trajectory tracking is realized by integrating PD control with Jacobian mapping. Experimental results validate the efficacy of the proposed method, offering an efficient and reliable approach for industrial robotic applications.
Building similarity graph...
Analyzing shared references across papers
Loading...
Puxuan Ning
Jiangping Mei
Jinlu Ni
Robotica
Tianjin University
Stevens Institute of Technology
Electric Power Research Institute
Building similarity graph...
Analyzing shared references across papers
Loading...
Ning et al. (Fri,) studied this question.
www.synapsesocial.com/papers/6992b3769b75e639e9b08382 — DOI: https://doi.org/10.1017/s0263574726103154