What question did this study set out to answer?

The research aims to develop an effective end-to-end multi-agent reinforcement learning approach for particle tracking in detectors.

February 12, 2026Open Access

Constrained collaborative optimization of charged particle tracking with multi-agent reinforcement learning

Key Points

The research aims to develop an effective end-to-end multi-agent reinforcement learning approach for particle tracking in detectors.
Employs multi-agent reinforcement learning with assignment constraints
Optimizes a parameterized policy for reconstructing particle tracks
Utilizes a safety layer for solving linear assignment problems
Implements cost margins to enhance policy predictions
Demonstrated improved performance over single and multi-agent baselines
Achieved effective particle track reconstruction in simulated proton imaging
Showed enhanced optimization and generalization with constraints
Reduced predictive instability through structured cost margins

Abstract

Reinforcement learning (RL) demonstrated immense success in modeling complex physics-driven systems, providing end-to-end trainable solutions by interacting with a simulated or real environ- ment, maximizing a scalar reward signal. In this work, we propose, building upon previous work, an end-to-end multi-agent RL approach with assignment constraints for reconstructing particle tracks in pixelated particle detectors. Our approach optimizes collaboratively a parameterized policy, functioning as a heuristic to a multidimensional assignment problem, by jointly minimiz- ing the total amount of particle scattering over the reconstructed tracks in a readout frame. To sat- isfy constraints, guaranteeing a unique assignment of particle hits, we propose a safety layer solv- ing a linear assignment problem for every joint action. Further, to enforce cost margins, increas- ing the distance of the local policies predictions to the decision boundaries of the optimizer map- pings, we recommend the use of an additional component in the blackbox gradient estimation, forcing the policy to solutions with lower total assignment costs. We empirically show on simu- lated data, generated for a particle detector developed for proton imaging, the effectiveness of our approach, compared to multiple single- and multi-agent baselines. We further demonstrate the effectiveness of constraints with cost margins for both optimization and generalization, introduced by wider regions with high reconstruction performance as well as reduced predictive instabilities. Our results form the basis for further developments in RL-based tracking, offering both enhanced performance with constrained policies and greater flexibility in optimizing tracking algorithms through the option for individual and team rewards.

Bookmark

View Full Paper

Bookmark

View Full Paper

Constrained collaborative optimization of charged particle tracking with multi-agent reinforcement learning

Key Points

Abstract

Cite This Study