What type of study is this?

September 10, 2025Open Access

Enhanced Q learning and deep reinforcement learning for unmanned combat intelligence planning in adversarial environments

Key Points

The MDRL-DQN algorithm improves task success rates to 89.6% in dispersed defense and 94.8% in concentrated defense scenarios.
In Scenario 1, the algorithm completes tasks in 720.8 seconds, which is 16.7% faster than Proximal Policy Optimization (PPO).
Using a multimodal approach, the algorithm effectively processes image and sensor data for improved task execution.
These findings suggest enhanced efficiency and stability in unmanned combat operations through advanced planning algorithms.

Abstract

This study proposes a multimodal deep reinforcement learning (MDRL) architecture, Multimodal Deep Reinforcement Learning-Deep Q-Network (MDRL-DQN), based on an improved Q-Learning algorithm. It aims to optimize Unmanned Aerial Vehicle (UAV) scheduling and execution capabilities in intelligent unmanned combat planning. By integrating an attention mechanism and an adaptive reward mechanism, the algorithm effectively fuses image data, sensor data, and intelligent information, enabling collaborative multimodal data processing. This improves task success rates, execution efficiency, and UAV deployment stability. Experimental results show that the improved MDRL-DQN algorithm demonstrates significant advantages in complex task scenarios. Specifically, in the long-distance dispersed defense (Scenario 1) and long-distance concentrated defense (Scenario 3), the task success rates reach 89.6% and 94.8%, respectively, outperforming other algorithms by several percentage points. Additionally, in Scenario 1, MDRL-DQN completes tasks in 720.8 s, which is 16.7% faster than Proximal Policy Optimization (PPO) at 865.3 s, highlighting its superior execution efficiency. These results indicate that the improved Q-Learning algorithm effectively enhances the efficiency and stability of unmanned combat tasks, providing new insights for intelligent planning in future unmanned operations.

Enhanced Q learning and deep reinforcement learning for unmanned combat intelligence planning in adversarial environments

Key Points

Abstract

Cite This Study

Also Consider

Also Consider