4.7 Article

Dopamine: generalization and bonuses

Journal

NEURAL NETWORKS
Volume 15, Issue 4-6, Pages 549-559

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/S0893-6080(02)00048-5

Keywords

dopamine; reinforcement learning, exploration; temporal difference; generalization

Ask authors/readers for more resources

In the temporal difference model of primate dopamine neurons, their phasic activity reports a prediction error for future reward. This model is supported by a wealth of experimental data. However, in certain circumstances, the activity of the dopamine cells seems anomalous under the model, as they respond in particular ways to stimuli that are not obviously related to predictions of reward. In this paper, we address two important sets of anomalies, those having to do with generalization and novelty. Generalization responses are treated as the natural consequence of partial information; novelty responses are treated by the suggestion that dopamine cells multiplex information about reward bonuses, including exploration bonuses and shaping bonuses. We interpret this additional role for dopamine in terms of the mechanistic attentional and psychomotor effects of dopamine, having the computational role of guiding exploration. (C) 2002 Published by Elsevier Science Ltd.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available