Towards Q-learning the Whittle Index for Restless Bandits | Synapse