What question did this study set out to answer?

The aim is to develop a DRL strategy for efficient maximum power point tracking in photovoltaic systems under diverse conditions.

March 26, 2026Open Access

Maximum power point tracking of solar - pv under partial shading and variable load resistance using deep reinforcement learning algorithm

Key Points

The aim is to develop a DRL strategy for efficient maximum power point tracking in photovoltaic systems under diverse conditions.
Implemented a deep reinforcement learning framework using a Deep Deterministic Policy Gradient controller.
Formulated MPPT as a physically constrained control problem for a PV boost converter.
Developed a custom reward function to maximize power extraction and minimize abrupt duty cycle changes.
Trained the DDPG agent with various irradiance profiles and load conditions.
Benchmarked performance against conventional MPPT strategies.
Achieved smooth convergence with the lowest voltage root mean square error under uniform irradiance.
Tracked the global maximum power point under partial shading, delivering up to 10% higher power output.
Maintained voltage ripple below 0.3% compared to conventional PID control.
Demonstrated strong stability and robustness under high-load resistance and distorted irradiance.

Abstract

This study develops a deep reinforcement learning (DRL) maximum power point tracking (MPPT) strategy for photovoltaic (PV) systems operating under uniform irradiance and partial shading conditions. A Deep Deterministic Policy Gradient (DDPG) controller is designed to directly regulate the duty cycle of a PV boost converter, enabling continuous, model-free tracking of the global maximum power point (GMPP). Unlike conventional MPPT techniques that rely on predefined perturbation rules or averaged system models, the proposed approach formulates MPPT as a physically constrained control problem at the switching-converter level. A custom reward function is formulated to simultaneously maximize power extraction and penalize abrupt duty-cycle variations, thereby improving converter safety and operational robustness. The DDPG agent is trained using diverse curved and distorted irradiance profiles and variable load conditions to enhance policy generalization. Performance is benchmarked against classical P&O–PID control and advanced nonlinear intelligent MPPT strategies as well as real life case study. Under uniform irradiance, the proposed controller achieves smooth convergence with the lowest voltage root mean square error, while under partial shading it consistently tracks the GMPP, delivering up to 10% higher power output with ripple below 0.3% compared to conventional PID control. The DDPG–DRL controller demonstrates superior stability and robustness under a combination of distorted irradiance and high-load resistance scenarios, maintaining ripple below 0.2% and voltage operation closer to theoretical optimal points. These results confirm the effectiveness and practical applicability of the proposed DDPG–DRL MPPT framework for real-world photovoltaic energy systems.

Bookmark

View Full Paper

Bookmark

View Full Paper

Maximum power point tracking of solar - pv under partial shading and variable load resistance using deep reinforcement learning algorithm

Key Points

Abstract

Cite This Study