What question did this study set out to answer?

The research aims to enhance drone navigation by learning an optimal reward function from expert demonstrations using inverse reinforcement learning.

February 19, 2026Open Access

Learning unknown reward function for drone navigation based on inverse deep reinforcement learning

Key Points

The research aims to enhance drone navigation by learning an optimal reward function from expert demonstrations using inverse reinforcement learning.
Employs adversarial inverse reinforcement learning (AIRL) to extract reward functions from expert demonstrations.
Compares the learned reward policy against human-designed rewards and baseline functions.
Evaluates the performance based on success rates, smoothness, and trajectory consistency.
The learned reward policy significantly improves the success rate of autonomous flight.
Comparison shows enhanced smoothness and consistency in drone trajectories.
These findings highlight the effectiveness of learning rewards from demonstrations over manual engineering.

Abstract

Abstract Autonomous drone navigation with deep reinforcement learning (DRL) is hindered by the difficulty of specifying reward functions for vision-based, continuous control in complex environments. We address this by using inverse reinforcement learning (IRL) to recover a task-aligned reward directly from expert demonstrations; specifically, we employ adversarial IRL (AIRL) to learn the reward. In evaluation, the learned-reward policy improves success rate, smoothness and trajectories consistency compared with a carefully tuned human-designed reward and baselines reward function. These results indicate that learning the reward from demonstrations provides a precise and transferable objective for autonomous flight, achieving better performance and better guidance under verification of our evaluation protocol without manual reward engineering. To the best of our knowledge, this is the first work to successfully apply an AIRL framework for visual drone navigation.

Mark Helpful

Bookmark

Relay

View Full Paper