Bandit Algorithms for Policy Learning: Methods, Implementation, and Welfare-performance | Synapse