☆ 4.5 Article

Instructional control of reinforcement learning: A behavioral and neurocomputational investigation

BRAIN RESEARCH (2009)

Journal

BRAIN RESEARCH

Volume 1299, Issue -, Pages 74-94

Publisher

ELSEVIER

DOI: 10.1016/j.brainres.2009.07.007

Keywords

Reward; Dopamine; Basal ganglia; Reinforcement learning; Rule-governance

Funding

NIMH NIH HHS [R01 MH080066-01A1, R01 MH080066] Funding Source: Medline

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Humans learn how to behave directly through environmental experience and indirectly through rules and instructions. Behavior analytic research has shown that instructions can control behavior, even when such behavior leads to sub-optimal outcomes (Hayes, S (Ed) 1989. Rule-governed behavior: cognition, contingencies, and instructional control. Plenum Press.). Here we examine the control of behavior through instructions in a reinforcement learning task known to depend on striatal dopaminergic function. Participants selected between probabilistically reinforced stimuli, and were (incorrectly) told that a specific stimulus had the highest (or lowest) reinforcement probability Despite experience to the contrary, instructions drove choice behavior. We present neural network simulations that capture the interactions between instruction-driven and reinforcement-driven behavior via two potential neural circuits one in which the striatum is inaccurately trained by instruction representations coming from prefrontal cortex/hippocampus (PFC/HC), and another in which the striatum learns the environmentally based reinforcement contingencies, but is overridden at decision output. Both models capture the core behavioral phenomena but, because they differ fundamentally on what is learned, make distinct predictions for subsequent behavioral and neuroimaging experiments. Finally, we attempt to distinguish between the proposed computational mechanisms governing instructed behavior by fitting a series of abstract Q-learning and Bayesian models to subject data. The best-fitting model supports one of the neural models, suggesting the existence of a confirmation bias in which the PFC/HC system trains the reinforcement system by amplifying outcomes that are consistent with instructions while diminishing inconsistent outcomes. (C) 2009 Elsevier B.V. All rights reserved

Instructional control of reinforcement learning: A behavioral and neurocomputational investigation

Journal

BRAIN RESEARCH

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Instructional control of reinforcement learning: A behavioral and neurocomputational investigation

Journal

BRAIN RESEARCH

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper