What does this research mean for the field?

A deep reinforcement learning-based optimization framework utilizing the Soft Actor-Critic algorithm enables efficient, fully automated preliminary design and multi-objective optimization of floating wind turbine substructures. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

This study aims to develop an intelligent framework for optimizing floating wind turbine substructure designs using deep reinforcement learning.

June 1, 2026Open Access

An intelligent framework for preliminary design optimization of floating wind substructures: reinforcement learning-based strategy and performance evaluation

Puntos clave

This study aims to develop an intelligent framework for optimizing floating wind turbine substructure designs using deep reinforcement learning.
Utilized the Soft Actor-Critic algorithm for optimization.
Engaged a coupled simulation environment for design exploration.
Implemented geometry parameterization and hydrodynamic coefficient computation.
Achieved efficient convergence in complex design spaces.
Enabled fully automated design iterations with minimal human intervention.
Demonstrated improvements in design performance and adaptability.

Resumen

Designing floating wind turbine (FWT) substructures involves navigating high-dimensional, nonlinear, and tightly constrained design spaces, where conventional manual or rule-based methods often struggle with scalability and efficiency. This study proposes a novel deep reinforcement learning (DRL)-based optimization framework that autonomously explores and optimizes preliminary substructure designs. Leveraging the Soft Actor-Critic (SAC) algorithm, the agent continuously interacts with a coupled simulation environment to identify high-performing configurations through trial-and-error learning, without explicit gradient information or exhaustive search. The environment integrates geometry parameterization, hydrodynamic coefficient computation via potential flow theory, and fully coupled dynamic response simulations using established tools such as Capytaine and OpenFAST. By interpreting platform performance indicators, including motions, structural responses, and stability metrics, as reward signals, the agent learns to balance trade-offs across conflicting objectives under physical constraints. Results demonstrate that the proposed approach achieves efficient convergence in complex continuous action spaces and enables fully automated design iterations with minimal human input. This work offers a promising direction for intelligent early-stage design of FWT foundations, especially in scenarios requiring adaptability, generalization, and multi-objective performance.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo