What question did this study set out to answer?

The aim is to develop an effective landing control system for lunar quadruped robots to address various operational challenges.

April 11, 2026Open Access

Reinforcement Learning-Based Landing Impact Mitigation and Stabilization Control for Lunar Quadruped Robots Under Complex Operating Conditions

Key Points

The aim is to develop an effective landing control system for lunar quadruped robots to address various operational challenges.
Developed a reinforcement learning-based landing controller.
Introduced a phase-structured formulation for impact management and stabilization.
Utilized a terrain-agnostic control representation for varying conditions.
Established a hybrid control model incorporating variable mass and impact constraints.
Conducted simulations and experimental validations under diverse landing conditions.
Achieved robust landing buffering and stability control across various masses and slopes.
Demonstrated less than 30% deviation between simulation and experimental results.
Showed favorable robustness against parameter variations during testing.

Abstract

Lunar quadruped robots face landing challenges including weak gravity, large mass variations, uncertain sloped terrain, and strict payload acceleration limits, requiring effective impact mitigation and rapid post-landing stabilization. This paper presents a novel end-to-end reinforcement learning-based landing controller with three key novelties: (i) a phase-structured yet implicitly encoded formulation that distinguishes contact preparation, energy dissipation, and stabilization without explicit phase switching; (ii) a terrain-agnostic state and control representation using equivalent support direction construction and contact-gated modulation to decouple normal–tangential dynamics; and (iii) an extremum oriented learning strategy that directly captures peak impact suppression and buffering sufficiency, addressing limitations of cumulative rewards in hybrid, peak-dominated tasks. A hybrid control model for lunar quadruped landing dynamics is established, incorporating variable mass, low impact, and full stroke as key constraints during training. Simulation and full-scale experimental prototypes are built to validate the controller. Simulation results demonstrate robust landing buffering and stability control under varying mass, landing velocity, and slope conditions, with favorable robustness against parameter variations. Experimental verification is conducted under diverse conditions including different masses (200 kg, 250 kg), vertical/horizontal landing velocities (0.8 m/s, 0.2 m/s), and slopes (0, 8). The deviation between simulation and experimental results does not exceed 30%, confirming the effectiveness and transferability of the proposed approach.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Li et al. (Thu,) studied this question.

synapsesocial.com/papers/69d9e4d578050d08c1b75358 https://doi.org/https://doi.org/10.3390/machines14040417

Bookmark

View Full Paper