Learning to Allocate Reusable Resources under Stochastic Rewards and Durations | Synapse