Key points are not available for this paper at this time.
Compared with reward seeking, punishment avoidance learning is less clearly understood at both the computational and neurobiological levels. Here we demonstrate, using computational modelling and fMRI in humans, that learning option values in a relative--context-dependent--scale offers a simple computational solution for avoidance learning. The context (or state) value sets the reference point to which an outcome should be compared before updating the option value. Consequently, in contexts with an overall negative expected value, successful punishment avoidance acquires a positive value, thus reinforcing the response. As revealed by post-learning assessment of options values, contextual influences are enhanced when subjects are informed about the result of the forgone alternative (counterfactual information). This is mirrored at the neural level by a shift in negative outcome encoding from the anterior insula to the ventral striatum, suggesting that value contextualization also limits the need to mobilize an opponent punishment learning system.
Building similarity graph...
Analyzing shared references across papers
Loading...
Stefano Palminteri
Inserm
Mehdi Khamassi
Centre National de la Recherche Scientifique
Mateus Joffily
Groupe d'Analyse et de Théorie Economique Lyon St Etienne
Nature Communications
Centre National de la Recherche Scientifique
University College London
Inserm
Building similarity graph...
Analyzing shared references across papers
Loading...
Palminteri et al. (Tue,) studied this question.
synapsesocial.com/papers/69db294a4a1e15904c83719c — DOI: https://doi.org/10.1038/ncomms9096