☆ 4.5 Article

Reinforcement learning, spike-time-dependent plasticity, and the BCM rule

NEURAL COMPUTATION (2007)

期刊

NEURAL COMPUTATION

卷 19, 期 8, 页码 2245-2279

出版社

MIT PRESS

DOI: 10.1162/neco.2007.19.8.2245

关键词

类别

Computer Science, Artificial Intelligence Neurosciences

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Learning agents, whether natural or artificial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is influenced by an environmental signal, termed a reward, that directs the changes in appropriate directions. We apply a recently introduced policy learning algorithm from machine learning to networks of spiking neurons and derive a spike-time-dependent plasticity rule that ensures convergence to a local optimum of the expected average reward. The approach is applicable to a broad class of neuronal models, including the Hodgkin-Huxley model. We demonstrate the effectiveness of the derived rule in several toy problems. Finally, through statistical analysis, we show that the synaptic plasticity rule established is closely related to the widely used BCM rule, for which good biological evidence exists.

Reinforcement learning, spike-time-dependent plasticity, and the BCM rule

期刊

NEURAL COMPUTATION

出版社

MIT PRESS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Reinforcement learning, spike-time-dependent plasticity, and the BCM rule

期刊

NEURAL COMPUTATION

出版社

MIT PRESS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文