What question did this study set out to answer?

This study aims to systematically evaluate deep reinforcement learning applications in legged robot locomotion.

February 2, 2026Open Access

A Systematic Review of Deep Reinforcement Learning for Legged Robot Locomotion

Key Points

This study aims to systematically evaluate deep reinforcement learning applications in legged robot locomotion.
Reviewed 27 peer-reviewed studies from 2018 to 2025
Analyzed model-free, model-based, hierarchical, and hybrid DRL frameworks
Focused on experimental platforms and benchmarking practices based on PRISMA guidelines
Found that reward shaping and policy representation significantly impact control performance
Highlighted the importance of domain randomization for real-world application
Identified energy management and safety as essential for effective algorithm deployment

Abstract

Legged robot locomotion remains a critical challenge in robotics, demanding control strategies that are not only dynamically stable and robust but also capable of adapting to complex and changing environments. deep reinforcement learning (DRL) has recently emerged as a powerful approach to automatically generate motion control policies by learning from interactions with simulated or real environments. This study provides a systematic overview of DRL applications in legged robot control, emphasizing experimental platforms, measurement techniques, and benchmarking practices. Following PRISMA guidelines, 27 peer-reviewed studies published between 2018 and 2025 were analyzed, covering model-free, model-based, hierarchical, and hybrid DRL frameworks. Our findings reveal that reward shaping, policy representation, and training stability significantly influence control performance, while domain randomization and dynamic adaptation methods are essential for bridging the simulation-to-real-world gap. In addition, this review highlights instrumentation approaches for evaluating algorithm effectiveness, offering insights into sample efficiency, energy management, and safe deployment. The results aim to guide the development of reproducible and experimentally validated DRL-based control systems for legged robots.

A Systematic Review of Deep Reinforcement Learning for Legged Robot Locomotion

Key Points

Abstract

Cite This Study

Also Consider

Also Consider