Los puntos clave no están disponibles para este artículo en este momento.
With the development of information technology, the degree of intelligence in air confrontation is increasing, and the demand for automated intelligent decision-making systems is becoming more intense. Based on the characteristics of over-the-horizon air confrontation, this paper constructs a super-horizon air confrontation training environment, which includes aircraft model modeling, air confrontation scene design, enemy aircraft strategy design, and reward and punishment signal design. In order to improve the efficiency of the reinforcement learning algorithm for the exploration of strategy space, this paper proposes a heuristic Q-Network method that integrates expert experience, and uses expert experience as a heuristic signal to guide the search process. At the same time, heuristic exploration and random exploration are combined. Aiming at the over-the-horizon air confrontation maneuver decision problem, the heuristic Q-Network method is adopted to train the neural network model in the over-the-horizon air confrontation training environment. Through continuous interaction with the environment, self-learning of the air confrontation maneuver strategy is realized. The efficiency of the heuristic Q-Network method and effectiveness of the air confrontation maneuver strategy are verified by simulation experiments.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhang et al. (Sat,) studied this question.
synapsesocial.com/papers/6a1ff8b11517a826fb04c2df — DOI: https://doi.org/10.3390/electronics7110279
Xianbing Zhang
China University of Geosciences (Beijing)
Guoqing Liu
Weifang Medical University
Chaojie Yang
North China Electric Power University
Electronics
Beihang University
Building similarity graph...
Analyzing shared references across papers
Loading...