Value-guided decisions are a cornerstone of cognition, yet the underlying circuit-level mechanisms remain elusive. We used reinforcement learning to train recurrent neural network models endowed with Dale's law on a battery of economic choice tasks, which revealed a two-stage computational framework. First, value estimation occurs at the input level, where learned weights store subjective preferences and approximate the non-linear multiplication of reward magnitude and probability to yield expected values. This feedforward mechanism enables generalization to novel choice options. Second, option values are compared within the recurrent network, where specific connectivity patterns mediate robust winner-take-all decisions, with both excitatory and inhibitory neurons exhibiting value and choice selectivity. By training a single network on multiple tasks, we show compositional representations combining a shared computational schema with specialized neural modules. Reproducing key neurophysiological findings from the primate orbitofrontal cortex, our model unifies value computation, comparison, and generalization into a coherent framework with testable predictions.
Building similarity graph...
Analyzing shared references across papers
Loading...
Aldo Battista
Camillo Padoa‐Schioppa
Xiao-Jing Wang
Neuron
Washington University in St. Louis
New York University
Building similarity graph...
Analyzing shared references across papers
Loading...
Battista et al. (Sun,) studied this question.
www.synapsesocial.com/papers/6988270a0fc35cd7a8845f2a — DOI: https://doi.org/10.1016/j.neuron.2025.12.010