What question did this study set out to answer?

The aim is to explore explainability in ML-based jet taggers for quark-gluon identification.

March 18, 2026Open Access

The physics behind ML-based quark-gluon taggers

Key Points

The aim is to explore explainability in ML-based jet taggers for quark-gluon identification.
Identified leading latent features related to physics observables.
Applied Shapley values to evaluate feature importance.
Utilized symbolic regression to create compact formulas for tagger output.
Established correlations between latent features and physics observables.
Showed limitations of Shapley values due to input correlations.
Derived functional approximations for the tagger output using symbolic regression.

Abstract

Jet taggers provide an ideal testbed for applying explainability techniques to powerful ML tools. For theoretically and experimentally challenging quark-gluon tagging, we first identify the leading latent features that correlate strongly with physics observables, both in a linear and a non-linear approach. Next, we show how Shapley values can assess feature importance, although the standard implementation assumes independent inputs and can lead to distorted attributions in the presence of correlations. Finally, we use symbolic regression to derive compact formulas to approximate the tagger output.

Bookmark

View Full Paper

Cite This Study

Vent et al. (Mon,) studied this question.

synapsesocial.com/papers/69ba43584e9516ffd37a4839 https://doi.org/https://doi.org/10.21468/scipostphys.20.3.084

Bookmark

View Full Paper