What type of study is this?

This is a Quantitative Study study.

September 29, 2025Open Access

Gait in Eight: Efficient On-Robot Learning for Omnidirectional Quadruped Locomotion

Key Points

Our framework demonstrates effective locomotion learning in just 8 minutes of real-time training.
Utilizing the off-policy algorithm CrossQ, we achieve high sample efficiency in quadruped training.
The approach combines predictive models and control architectures to enhance both speed and stability.
Results validate the method across varied environments, indicating robustness and adaptability.

Abstract

On-robot Reinforcement Learning is a promising approach to train embodiment-aware policies for legged robots. However, the computational constraints of real-time learning on robots pose a significant challenge. We present a framework for efficiently learning quadruped locomotion in just 8 minutes of raw real-time training utilizing the sample efficiency and minimal computational overhead of the new off-policy algorithm CrossQ. We investigate two control architectures: Predicting joint target positions for agile, high-speed locomotion and Central Pattern Generators for stable, natural gaits. While prior work focused on learning simple forward gaits, our framework extends on-robot learning to omnidirectional locomotion. We demonstrate the robustness of our approach in different indoor and outdoor environments.

Read Full Paperexternally

اسأل الذكاء الاصطناعي

Bookmark

View Full Paper