4.4 Article

Efficient reinforcement learning of a reservoir network model of parametric working memory achieved with a cluster population winner-take-all readout mechanism

Journal

JOURNAL OF NEUROPHYSIOLOGY
Volume 114, Issue 6, Pages 3296-3305

Publisher

AMER PHYSIOLOGICAL SOC
DOI: 10.1152/jn.00378.2015

Keywords

working memory; reservoir network; reinforcement learning

Funding

  1. Strategic Priority Research Program of the Chinese Academy of Science Grant [XDB02050500]
  2. CAS Hundreds of Talents Program
  3. Shanghai Pujiang Program
  4. National Science Foundation of China (NSFC) [91420106, 90820305, 60775040, 61005085]
  5. Natural Science Foundation of Zhe Jiang (ZJNSF) [Y2111013]

Ask authors/readers for more resources

The brain often has to make decisions based on information stored in working memory, but the neural circuitry underlying working memory is not fully understood. Many theoretical efforts have been focused on modeling the persistent delay period activity in the prefrontal areas that is believed to represent working memory. Recent experiments reveal that the delay period activity in the prefrontal cortex is neither static nor homogeneous as previously assumed. Models based on reservoir networks have been proposed to model such a dynamical activity pattern. The connections between neurons within a reservoir are random and do not require explicit tuning. Information storage does not depend on the stable states of the network. However, it is not clear how the encoded information can be retrieved for decision making with a biologically realistic algorithm. We therefore built a reservoir-based neural network to model the neuronal responses of the prefrontal cortex in a somatosensory delayed discrimination task. We first illustrate that the neurons in the reservoir exhibit a heterogeneous and dynamical delay period activity observed in previous experiments. Then we show that a cluster population circuit decodes the information from the reservoir with a winner-take-all mechanism and contributes to the decision making. Finally, we show that the model achieves a good performance rapidly by shaping only the readout with reinforcement learning. Our model reproduces important features of previous behavior and neurophysiology data. We illustrate for the first time how task-specific information stored in a reservoir network can be retrieved with a biologically plausible reinforcement learning training scheme.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available