Using confidence bounds for exploitation-exploration trade-offs | Synapse