Key points are not available for this paper at this time.
Autonomous umanned aerial vehicle (UAV) manipulation is necessary for the defense department to execute tactical missions given by commanders in the future unmanned battlefield. A large amount of research has been devoted to improving the autonomous decision-making ability of UAV in an interactive environment, where finding the optimal maneuvering decision-making policy became one of the key issues for enabling the intelligence of UAV. In this paper, we propose a maneuvering decision-making algorithm for autonomous air-delivery based on deep reinforcement learning under the guidance of expert experience. Specifically, we refine the guidance towards area and guidance towards specific point tasks for the air-delivery process based on the traditional air-to-surface fire control methods. Moreover, we construct the UAV maneuvering decision-making model based on Markov decision processes (MDPs). Specifically, we present a reward shaping method for the guidance towards area and guidance towards specific point tasks using potential-based function and expert-guided advice. The proposed algorithm could accelerate the convergence of the maneuvering decision-making policy and increase the stability of the policy in terms of the output during the later stage of training process. The effectiveness of the proposed maneuvering decision-making policy is illustrated by the curves of training parameters and extensive experimental results for testing the trained policy.
Building similarity graph...
Analyzing shared references across papers
Loading...
Guang Zhan
Kun Zhang
Ke Li
Journal of Systems Engineering and Electronics
Northwestern Polytechnical University
AVIC Optronics (China)
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhan et al. (Tue,) studied this question.
www.synapsesocial.com/papers/68e6df92b6db64358765b2c0 — DOI: https://doi.org/10.23919/jsee.2024.000022