What question did this study set out to answer?

This research aims to maximize the energy output of wave energy converters using a deep reinforcement learning approach.

April 19, 2026Open Access

Deep reinforcement learning for energy maximization in AWS wave energy systems with supercapacitor storage unit

Puntos clave

This research aims to maximize the energy output of wave energy converters using a deep reinforcement learning approach.
Applied the deep deterministic policy gradient (DDPG) algorithm for energy control.
Integrated a supercapacitor storage unit within the wave energy converter architecture.
Utilized real-time data on wave excitation, floater velocity, and position.
Compared performance against traditional control methods under variable sea conditions.
DDPG algorithm improved energy yield by 20% over model-based control (MBC).
Achieved a 79% increase compared to feedback linearization control (FLC).
Outperformed approximate current control (ACC) by 100% and no control by 156%.
Confirmed superiority using real ocean data, excelling in 8 of 12 months evaluated.

Resumen

A deep deterministic policy gradient (DDPG)-based reinforcement learning (RL) controller is presented in this research to maximize the energy yield of the Archimedes Wave Swing (AWS) wave energy converter (WEC). This is the first application of the DDPG algorithm to the AWS WEC architecture integrated with a supercapacitor storage unit. Unlike discrete deep Q-learning (not suitable for continuous action) or on-policy methods such as soft actor-critic (SAC) and proximal policy optimization (PPO), which require higher computational resources. DDPG was chosen because it handles continuous action spaces, learns model-free optimal policies online, and adapts to irregular waves without requiring accurate excitation force prediction. By using real-time observations of wave excitation force, floater velocity, and position, a DDPG agent provides the value of quadrature axis current ( I q - r e f ) to control the generator damping across irregular wave conditions. The architecture incorporates a supercapacitor storage system coupled via a bidirectional DC-DC converter. This ensures that the AWS generator is supplied with reverse mechanical power whenever needed. The performance of the proposed method is benchmarked against established strategies such as approximate current control (ACC), feedback linearization control (FLC), and model-based control (MBC) under both simulated and real-world wave data. Simulations over 500 s demonstrate that under irregular sea states ( H s = 4 m and T p = 8 s ), the trained DDPG yields energy increases by 20% over MBC, 79% over FLC, 100% over ACC, and 156% over no control. Additional evaluations using real ocean data from Hawaii (NDBC Station 51003) confirm the DDPG agent’s superiority in 8 out of 12 months.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo

Cite This Study

Ali et al. (Fri,) studied this question.

synapsesocial.com/papers/69e4745f010ef96374d901c1 https://doi.org/https://doi.org/10.1016/j.asej.2026.104169

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Me gusta

Guardar

Ver artículo completo