Non-stationary Bandits with Heavy Tail

Key Points

Key points are not available for this paper at this time.

Abstract

In this study, we investigate the performance of multi-armed bandit algorithms in environments characterized by heavytailed and non-stationary reward distributions, a setting that deviates from the conventional risk-neutral and sub- Gaussian assumptions.

Bookmark

Cite This Study

Pan et al. (Thu,) studied this question.

synapsesocial.com/papers/68e59556b6db643587530053 https://doi.org/https://doi.org/10.1145/3695411.3695424

Bookmark