期刊
JOURNAL OF NEUROPHYSIOLOGY
卷 115, 期 6, 页码 3195-3203出版社
AMER PHYSIOLOGICAL SOC
DOI: 10.1152/jn.00046.2016
关键词
observational learning; reward prediction error; reinforcement learning; ventral striatum; fMRI
资金
- Wellcome Trust
- NIMH Caltech Conte Center for the Neurobiology of Social Decision Making
A major open question is whether computational strategies thought to be used during experiential learning, specifically model-based and model-free reinforcement learning, also support observational learning. Furthermore, the question of how observational learning occurs when observers must learn about the value of options from observing outcomes in the absence of choice has not been addressed. In the present study we used a multi-armed bandit task that encouraged human participants to employ both experiential and observational learning while they underwent functional magnetic resonance imaging (fMRI). We found evidence for the presence of model-based learning signals during both observational and experiential learning in the intraparietal sulcus. However, unlike during experiential learning, model-free learning signals in the ventral striatum were not detectable during this form of observational learning. These results provide insight into the flexibility of the model-based learning system, implicating this system in learning during observation as well as from direct experience, and further suggest that the model-free reinforcement learning system may be less flexible with regard to its involvement in observational learning.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据