4.5 Article

Observing others stay or switch - How social prediction errors are integrated into reward reversal learning

期刊

COGNITION
卷 153, 期 -, 页码 19-32

出版社

ELSEVIER SCIENCE BV
DOI: 10.1016/j.cognition.2016.04.012

关键词

Reversal learning; Social influence; Reward; Prediction error; Similarity

资金

  1. National Center for Mental Health (Cardiff, Wales)
  2. Economic and Social Research Council of the UK (ESRC) [RES-062-23-0946]
  3. German Research Foundation (DFG) [RES-062-23-0946]
  4. ESRC [ES/F025831/1] Funding Source: UKRI
  5. Economic and Social Research Council [ES/F025831/1, ES/F025831/2] Funding Source: researchfish
  6. Medical Research Council [MR/L010305/1] Funding Source: researchfish

向作者/读者索取更多资源

Reward properties of stimuli can undergo sudden changes, and the detection of these 'reversals' is often made difficult by the probabilistic nature of rewards/punishments. Here we tested whether and how humans use social information (someone else's choices) to overcome uncertainty during reversal learning. We show a substantial social influence during reversal learning, which was modulated by the type of observed behavior. Participants frequently followed observed conservative choices (no switches after punishment) made by the (fictitious) other player but ignored impulsive choices (switches), even though the experiment was set up so that both types of response behavior would be similarly beneficial/detrimental (Study 1). Computational modeling showed that participants integrated the observed choices as a 'social prediction error' instead of ignoring or blindly following the other player. Modeling also confirmed higher learning rates for 'conservative' versus 'impulsive' social prediction errors. Importantly, this 'conservative bias' was boosted by interpersonal similarity, which in conjunction with the lack of effects observed in a non-social control experiment (Study 2) confirmed its social nature. A third study suggested that relative weighting of observed impulsive responses increased with increased volatility (frequency of reversals). Finally, simulations showed that in the present paradigm integrating social and reward information was not necessarily more adaptive to maximize earnings than learning from reward alone. Moreover, integrating social information increased accuracy only when conservative and impulsive choices were weighted similarly during learning. These findings suggest that to guide decisions in choice contexts that involve reward reversals humans utilize social cues conforming with their preconceptions more strongly than cues conflicting with them, especially when the other is similar. (C) 2016 The Authors. Published by Elsevier B.V.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据