What question did this study set out to answer?

To optimize charging strategies for UAVs in wireless rechargeable sensor networks using IRS and DRL techniques.

January 25, 2026

DRL-Based Charging Strategy Optimization for IRS-Assisted UAV in Wireless Rechargeable Sensor Networks

Key Points

To optimize charging strategies for UAVs in wireless rechargeable sensor networks using IRS and DRL techniques.
Developed a model with IRS mounted beneath UAV
Implemented a two-stage DRL algorithm named SCA-GMAPPO
Integrated Gated Recurrent Unit (GRU) in the MAPPO framework
Applied Successive Convex Approximation (SCA) for phase shift optimization
SCA-GMAPPO reduces computational time compared to traditional DRL methods
Improved system energy efficiency and fairness among sensors
UAVs successfully navigate complex environments with obstacles

Abstract

In Wireless Rechargeable Sensor Networks, conventional long-range wireless power transfer technologies employed by mobile charging devices (e.g., Unmanned Aerial Vehicles (UAVs)) are inefficient. Intelligent Reflecting Surface (IRS) can effectively address this issue by enhancing channel gain through the adjustment of phase shifts. However, in most exsiting studies, the placement of IRS is constrained by spatial limitations. To address these problems, a model is formulated in which the IRS is mounted beneath the UAV and a two-stage algorithm based on Deep Reinforcement Learning (DRL) named “SCA-GMAPPO” is proposed. Compared to approaches that use DRL alone to simultaneously optimize UAVs trajectory the phase shift of IRS reflecting elements, the proposed approach significantly reduces computational time. First, Gated Recurrent Unit (GRU) is integrated within the MAPPO framework to accurately capture the trajectory variations and charging duration. Second, the Successive Convex Approximation (SCA) algorithm is employed to optimize the phase shift of IRS reflecting elements, which reduces computational overhead and enhances the energy reception efficiency of sensor nodes. The experimental results demonstrate that SCA-GMAPPO outperforms existing mainstream DRL methods in terms of system energy efficiency and energy fairness. Furthermore, in complex environments with obstacles, UAVs are able to accurately find safe trajectories.

Bookmark

Cite This Study

Liu et al. (Wed,) studied this question.

synapsesocial.com/papers/6975b20efeba4585c2d6d7b3 https://doi.org/https://doi.org/10.1145/3789206

Bookmark