On the Gittins Index for Multiarmed Bandits

Key Points

Key points are not available for this paper at this time.

Abstract

This paper considers the multiarmed bandit problem and presents a new proof of the optimality of the Gittins index policy. The proof is intuitive and does not require an interchange argument. The insight it affords is used to give a streamlined summary of previous research and to prove a new result: The optimal value function is a submodular set function of the available projects.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Richard Weber

University of Cambridge

Journals

The Annals of Applied Probability

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

On the Gittins Index for Multiarmed Bandits

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study