Key points are not available for this paper at this time.
Estimation of returns over time, the focus of temporal difference (TD) algorithms, imposes particular constraints on good function approximators or representations. Appropriate generalization between states is determined by how similar their successors are, and representations should follow suit. This paper shows how TD machinery can be used to learn such representations, and illustrates, using a navigation task, the appropriately distributed nature of the result.
Building similarity graph...
Analyzing shared references across papers
Loading...
Peter Dayan
Neural Computation
Salk Institute for Biological Studies
Building similarity graph...
Analyzing shared references across papers
Loading...
Peter Dayan (Thu,) studied this question.
www.synapsesocial.com/papers/6a0916f957846b5001d3a28a — DOI: https://doi.org/10.1162/neco.1993.5.4.613
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: