Key points are not available for this paper at this time.
In this paper, we propose a deep reinforcement learning (DRL)-based method that allows unmanned aerial vehicles (UAVs) to execute navigation tasks in large-scale complex environments. This technique is important for many applications such as goods delivery and remote surveillance. The problem is formulated as a partially observable Markov decision process (POMDP) and solved by a novel online DRL algorithm designed based on two strictly proved policy gradient theorems within the actor-critic framework. In contrast to conventional simultaneous localization and mapping-based or sensing and avoidance-based approaches, our method directly maps UAVs' raw sensory measurements into control signals for navigation. Experiment results demonstrate that our method can enable UAVs to autonomously perform navigation in a virtual large-scale complex environment and can be generalized to more complex, larger-scale, and three-dimensional environments. Besides, the proposed online DRL algorithm addressing POMDPs outperforms the state-of-the-art.
Building similarity graph...
Analyzing shared references across papers
Loading...
Chao Wang
Jian Wang
Yuan Shen
IEEE Transactions on Vehicular Technology
Tsinghua University
Building similarity graph...
Analyzing shared references across papers
Loading...
Wang et al. (Thu,) studied this question.
www.synapsesocial.com/papers/6a0ef57a2eca052da647f5e3 — DOI: https://doi.org/10.1109/tvt.2018.2890773
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: