February 25, 2026Open Access

Adaptive multi-mode locomotion for bipedal wheel-legged robots via sparse mixture-of-experts deep reinforcement learning

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

The bipedal wheel-legged robot combines the high energy efficiency of wheeled movement with the terrain adaptability of legged locomotion. However, achieving a smooth transition between these two heterogeneous motion modes within a unified control framework remains challenging. This study proposes a reinforcement learning control framework that integrates the Mixture of Experts (MoE) architecture. This approach employs a "divide and conquer" strategy by introducing a dynamic gating network and a Top-K sparse activation mechanism, which automatically allocates different motion modes to specific expert subnetworks, effectively decoupling conflicting gradients. Simulation results demonstrate that, compared to the single-network PPO method, the MoE-enhanced algorithm exhibits significant improvements in training stability and rewards. The learned policy successfully achieved smooth rolling on flat surfaces and transitioned to dynamic leg-lifting gaits when confronted with obstacles. In various test terrains, it showed a markedly higher success rate compared to the single-network PPO method.

Me gusta

Guardar

Ver artículo completo

Cite This Study

He et al. (Wed,) studied this question.

synapsesocial.com/papers/6a07a124d343c0cd6cc63e59 https://doi.org/https://doi.org/10.3389/frobt.2026.1788395

Me gusta

Guardar

Ver artículo completo