What question did this study set out to answer?

The aim is to optimize metamaterial absorbers and polarization converters using an advanced reinforcement learning algorithm.

April 21, 2026Open Access

Optimization of broadband metamaterial absorber using twin delayed deep deterministic policy gradient reinforcement learning technique

Key Points

The aim is to optimize metamaterial absorbers and polarization converters using an advanced reinforcement learning algorithm.
Utilized a TD3 algorithm for optimization without needing gradient or surrogate models.
Focused on enhancing the geometric parameters of existing photonic structures.
Fabricated a polarization converter based on the optimized design.
Achieved absorption performance exceeding 90% for the metamaterial absorber across the specified frequency range.
The polarization converter demonstrated a conversion ratio above 90% in a wide frequency range.
Maintained strong performance even at oblique incidence with a conversion ratio above 80% up to 50°.

Abstract

This paper presents a new reinforcement learning (RL) -driven inverse design strategy that leverages the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm for the efficient optimization of photonic structures, with a focus on metamaterial absorbers (MAs) and cross polarization converters (CPC) as demonstrative applications. Unlike conventional heuristic or surrogate-based optimization methods, the proposed RL approach autonomously learns the optimal geometric configuration through direct interaction with the simulation environment, without requiring gradient information or pre-built surrogate models. Initially, the TD3 model is used to optimize the geometric parameters of an existing MA based on an L-shaped resonator, significantly enhancing its absorption performance to be greater than 90% in the frequency range from 12. 2 GHz to 22. 4 GHz in only 23 iterations. Then, a novel CPC design is proposed, optimized using the same RL framework, and subsequently fabricated. The fabricated structure achieves high polarization conversion ratio (PCR) above 90% over a wide frequency range from 11. 8 GHz to 24. 2 GHz, covering the full Ku band and most of the K band. Furthermore, over most of the frequency range, the converter maintains strong performance under oblique incidence, with PCR levels above 80% up to an angle of 50 ^. These results validate the effectiveness of the TD3-based RL framework in discovering high-performance and fabrication-ready designs, while also establishing a scalable and generalizable optimization paradigm for advanced photonic devices.

Bookmark

View Full Paper

Cite This Study

Mahmoud et al. (Sat,) studied this question.

synapsesocial.com/papers/69e7138bcb99343efc98cffa https://doi.org/https://doi.org/10.1038/s41598-026-41716-8

Bookmark

View Full Paper