☆ 4.5 Article

Prospective and retrospective temporal difference learning

NETWORK-COMPUTATION IN NEURAL SYSTEMS (2009)

期刊

NETWORK-COMPUTATION IN NEURAL SYSTEMS

卷 20, 期 1, 页码 32-46

出版社

TAYLOR & FRANCIS INC

DOI: 10.1080/09548980902759086

关键词

Emotional processing; reinforcement learning

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic Neurosciences

资金

Gatsby Charitable Foundation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

A striking recent finding is that monkeys behave maladaptively in a class of tasks in which they know that reward is going to be systematically delayed. This may be explained by a malign Pavlovian influence arising from states with low predicted values. However, by very carefully analyzing behavioral data from such tasks, La Camera and Richmond (2008) observed the additional important characteristic that subjects perform differently on states in the task that are at equal distances from the future reward, depending on what has happened in the recent past. The authors pointed out that this violates the definition of state value in the standard reinforcement learning models that are ubiquitous as accounts of operant and classical conditioned behavior; they suggested and analyzed an alternative temporal difference (TD) model in which past and future are melded. Here, we show that, in fact, a standard TD model can actually exhibit the same behavior, and that this avoids deleterious consequences for choice. At the heart of the model is the average reward per step, which acts as a baseline for measuring immediate rewards. Relatively subtle changes to this baseline occasioned by the past can markedly influence predictions and thus behavior.

Prospective and retrospective temporal difference learning

期刊

NETWORK-COMPUTATION IN NEURAL SYSTEMS

出版社

TAYLOR & FRANCIS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Prospective and retrospective temporal difference learning

期刊

NETWORK-COMPUTATION IN NEURAL SYSTEMS

出版社

TAYLOR & FRANCIS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文