Fast gradient-descent methods for temporal-difference learning with linear function approximation | Synapse