Abstract Legged robots have demonstrated remarkable potential for dynamic locomotion and terrain adaptability, making them a prominent focus of research. However, achieving robust and agile bipedal running remains challenging due to the complex dynamics of legged locomotion. In this paper, we propose a reinforcement learning framework for robust bipedal running, incorporating a simple reference trajectory generator and an asymmetric actor-critic architecture. The reference generator, based on kinematics, provides diverse trajectory references while preserving key gait characteristics, facilitating efficient policy exploration. To mitigate the simulation-to-reality gap, we extract latent variables encoding environmental and motion information from dual historical observations. Our method simplifies the trajectory generation process while maintaining effective guidance for learning. Extensive simulation and physical experiments demonstrate that, compared to model-based and learning-based baselines, our approach achieves higher agility, more accurate velocity tracking, and stronger disturbance rejection while preserving gait stability. The resulting controller exhibits spring–mass running dynamics that remain robust on both flat and uneven terrains.
Building similarity graph...
Analyzing shared references across papers
Loading...
Yunpeng Liang
Zhihui Peng
Yanzheng Zhao
Robotica
Shanghai Jiao Tong University
Building similarity graph...
Analyzing shared references across papers
Loading...
Liang et al. (Tue,) studied this question.
www.synapsesocial.com/papers/6967193f87ba607552bb921e — DOI: https://doi.org/10.1017/s0263574725103007