☆ 4.5 Article

A Spiking Neural Network Model of an Actor-Critic Learning Agent

NEURAL COMPUTATION (2009)

期刊

NEURAL COMPUTATION

卷 21, 期 2, 页码 301-339

出版社

MIT PRESS

DOI: 10.1162/neco.2008.08-07-593

关键词

类别

Computer Science, Artificial Intelligence Neurosciences

资金

DIP [F1.2]
BMBF [01GQ0420]
EU [15879]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The ability to adapt behavior to maximize reward as a result of interactions with the environment is crucial for the survival of any higher organism. In the framework of reinforcement learning, temporal-difference learning algorithms provide an effective strategy for such goal-directed adaptation, but it is unclear to what extent these algorithms are compatible with neural computation. In this article, we present a spiking neural network model that implements actor-critic temporal-difference learning by combining local plasticity rules with a global reward signal. The network is capable of solving a nontrivial gridworld task with sparse rewards. We derive a quantitative mapping of plasticity parameters and synaptic weights to the corresponding variables in the standard algorithmic formulation and demonstrate that the network learns with a similar speed to its discrete time counterpart and attains the same equilibrium performance.

A Spiking Neural Network Model of an Actor-Critic Learning Agent

期刊

NEURAL COMPUTATION

出版社

MIT PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A Spiking Neural Network Model of an Actor-Critic Learning Agent

期刊

NEURAL COMPUTATION

出版社

MIT PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文