What question did this study set out to answer?

The study aims to develop Gym-TORAX, a Python package for creating RL environments for tokamak plasma control.

March 2, 2026Open Access

Gym-TORAX: Open-source software for integrating reinforcement learning with plasma control simulators in tokamak research

Key Points

The study aims to develop Gym-TORAX, a Python package for creating RL environments for tokamak plasma control.
Developed an open-source Gymnasium environment for RL.
Defined action space, state-observation space, and reward function based on plasma characteristics.
Utilized the TORAX plasma simulator to compute plasma states.
Tested compatibility with existing RL algorithms and libraries.
Established one functional environment based on the ITER scenario.
Enabled dynamic plasma modeling using the TORAX simulator.
Facilitated the application of RL techniques in plasma control research.

Abstract

This paper presents Gym-TORAX, a Python package to define Reinforcement Learning (RL) environments for plasma control in tokamaks. Gym-TORAX instantiates a Gymnasium environment from an action space, a state-observation space, and a reward function that measures plasma characteristics. The environment computes plasma states using the TORAX plasma simulator and the objective is to maximize the expected sum of rewards. This plasma control formalization is compatible with most RL algorithms and libraries to facilitate RL research and applications. In its current version, one environment is readily available, based on an International Thermonuclear Experimental Reactor (ITER) scenario. • Reinforcement learning environment in Gymnasium for cross-compatibility. • Reinforcement learning for tokamak plasma control. • TORAX simulations for dynamic plasma modeling.

Gym-TORAX: Open-source software for integrating reinforcement learning with plasma control simulators in tokamak research

Key Points

Abstract

Cite This Study

Also Consider

Also Consider