☆ 4.4 Article

Reinforcement learning with modulated spike timing-dependent synaptic plasticity

JOURNAL OF NEUROPHYSIOLOGY (2007)

期刊

JOURNAL OF NEUROPHYSIOLOGY

卷 98, 期 6, 页码 3648-3665

出版社

AMER PHYSIOLOGICAL SOC

DOI: 10.1152/jn.00364.2007

关键词

类别

Neurosciences Physiology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Spike timing-dependent synaptic plasticity (STDP) has emerged as the preferred framework linking patterns of pre- and postsynaptic activity to changes in synaptic strength. Although synaptic plasticity is widely believed to be a major component of learning, it is unclear how STDP itself could serve as a mechanism for general purpose learning. On the other hand, algorithms for reinforcement learning work on a wide variety of problems, but lack an experimentally established neural implementation. Here, we combine these paradigms in a novel model in which a modified version of STDP achieves reinforcement learning. We build this model in stages, identifying a minimal set of conditions needed to make it work. Using a performance-modulated modification of STDP in a two-layer feedforward network, we can train output neurons to generate arbitrarily selected spike trains or population responses. Furthermore, a given network can learn distinct responses to several different input patterns. We also describe in detail how this model might be implemented biologically. Thus our model offers a novel and biologically plausible implementation of reinforcement learning that is capable of training a neural population to produce a very wide range of possible mappings between synaptic input and spiking output.

Reinforcement learning with modulated spike timing-dependent synaptic plasticity

期刊

JOURNAL OF NEUROPHYSIOLOGY

出版社

AMER PHYSIOLOGICAL SOC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Reinforcement learning with modulated spike timing-dependent synaptic plasticity

期刊

JOURNAL OF NEUROPHYSIOLOGY

出版社

AMER PHYSIOLOGICAL SOC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文