What question did this study set out to answer?

To develop a framework that combines reinforcement learning and path optimization for effective dynamic obstacle avoidance in robotic arms.

February 9, 2026Open Access

Adaptive Obstacle Avoidance for Robotic Arms Using Hierarchical Reinforcement Learning and Path Optimization

Key Points

To develop a framework that combines reinforcement learning and path optimization for effective dynamic obstacle avoidance in robotic arms.
Utilized proximal policy optimization (PPO) for global obstacle avoidance strategies.
Implemented rapidly exploring random tree star (RRT*) for refining local trajectories.
Adopted a curriculum learning approach to progressively train on more difficult scenarios.
Employed a multiobjective reward function with step-efficiency and potential field principles.
Achieved an 87.6% success rate in dynamic obstacle avoidance scenarios.
Outperformed standalone PPO and existing hybrid approaches in effectiveness.
Demonstrated improved trajectory precision in unstructured environments.

Abstract

ABSTRACT Dynamic obstacle avoidance remains a key challenge in robotic arm motion planning, as traditional algorithms struggle to balance adaptive decision‐making with precise trajectory generation in unstructured environments. We present a hierarchical motion planning framework that combines proximal policy optimization (PPO) with rapidly exploring random tree star (RRT*), trained using a curriculum learning paradigm. PPO learns global obstacle avoidance strategies through progressively difficult training scenarios, while RRT* refines local trajectories to compensate for PPO's limitations in fine motor control. A multiobjective reward function—incorporating step‐efficiency terms and artificial potential field principles—balances exploration and exploitation through tailored penalties and rewards. In dynamic obstacle scenarios, the proposed method achieves an 87.6% success rate, outperforming standalone PPO and existing hybrid reinforcement learning approaches. This framework offers a practical solution for dynamic obstacle avoidance with broader applicability to high‐dimensional autonomous manipulation tasks.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper