☆ 4.6 Article

Combined model-free and model-sensitive reinforcement learning in non-human primates

PLOS COMPUTATIONAL BIOLOGY (2020)

Journal

PLOS COMPUTATIONAL BIOLOGY

Volume 16, Issue 6, Pages -

Publisher

PUBLIC LIBRARY SCIENCE

DOI: 10.1371/journal.pcbi.1007944

Keywords

Funding

Fundacao para a Ciencia e Tecnologia [SFRH/BD/51711/2011]
Premio Joao Lobo Antunes 2017 - Santa Casa da Misericordia de Lisboa
Astor Foundation
Rostrees Charitable Trust
Wellcome Trust Senior Research Fellowship [WT104765MA]
James S McDonnell Foundation [JSMF220020372]
Gatsby Charitable Foundation
Max Planck Society
Alexander von Humboldt Foundatoin
Wellcome Trust New Investigator Award [096689/Z/11/Z]
Wellcome Trust [096689/Z/11/Z] Funding Source: Wellcome Trust
Wellcome Trust [096689/Z/11/Z] Funding Source: researchfish
Fundação para a Ciência e a Tecnologia [SFRH/BD/51711/2011] Funding Source: FCT

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Contemporary reinforcement learning (RL) theory suggests that potential choices can be evaluated by strategies that may or may not be sensitive to the computational structure of tasks. A paradigmatic model-free (MF) strategy simply repeats actions that have been rewarded in the past; by contrast, model-sensitive (MS) strategies exploit richer information associated with knowledge of task dynamics. MF and MS strategies should typically be combined, because they have complementary statistical and computational strengths; however, this tradeoff between MF/MS RL has mostly only been demonstrated in humans, often with only modest numbers of trials. We trained rhesus monkeys to perform a two-stage decision task designed to elicit and discriminate the use of MF and MS methods. A descriptive analysis of choice behaviour revealed directly that the structure of the task (of MS importance) and the reward history (of MF and MS importance) significantly influenced both choice and response vigour. A detailed, trial-by-trial computational analysis confirmed that choices were made according to a combination of strategies, with a dominant influence of a particular form of model sensitivity that persisted over weeks of testing. The residuals from this model necessitated development of a new combined RL model which incorporates a particular credit assignment weighting procedure. Finally, response vigor exhibited a subtly different collection of MF and MS influences. These results provide new illumination onto RL behavioural processes in non-human primates.

Combined model-free and model-sensitive reinforcement learning in non-human primates

Journal

PLOS COMPUTATIONAL BIOLOGY

Publisher

PUBLIC LIBRARY SCIENCE

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Combined model-free and model-sensitive reinforcement learning in non-human primates

Journal

PLOS COMPUTATIONAL BIOLOGY

Publisher

PUBLIC LIBRARY SCIENCE

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper