Adaptive Order Q-learning | Synapse