4.6 Article

Local and global stimuli in reinforcement learning

期刊

NEW JOURNAL OF PHYSICS
卷 23, 期 8, 页码 -

出版社

IOP Publishing Ltd
DOI: 10.1088/1367-2630/ac170a

关键词

reinforcement learning; local and global stimuli; conditional cooperation; moody conditional cooperation

资金

  1. National Natural Science Foundation for Distinguished Young Scholars [62025602]
  2. National Natural Science Foundation of China [U1803263, 11931015, 81961138010]
  3. National Key R&D Program of China [2019YFB2102304, 2018YFB1403501]
  4. Fok Ying-Tong Education Foundation, China [171105]
  5. Key Technology Research and Development Program of Science and Technology-Scientific and Technological Innovation Team of Shaanxi Province [2020TD-013]
  6. Key Area R&D Program of Guangdong Province [2019B010137004]
  7. Slovenian Research Agency [P1-0403, J1-2457, J1-9112]
  8. National Natural Science Foundation of China (NNSFC) [11931015, 11671348]

向作者/读者索取更多资源

Reinforcement learning is an alternative to imitation and exploration in resolving social dilemmas, where individuals adjust strategies based on their own past performance and preset aspirations. Stimuli play a crucial role in determining whether a strategy should be retained.
In efforts to resolve social dilemmas, reinforcement learning is an alternative to imitation and exploration in evolutionary game theory. While imitation and exploration rely on the performance of neighbors, in reinforcement learning individuals alter their strategies based on their own performance in the past. For example, according to the Bush-Mosteller model of reinforcement learning, an individual's strategy choice is driven by whether the received payoff satisfies a preset aspiration or not. Stimuli also play a key role in reinforcement learning in that they can determine whether a strategy should be kept or not. Here we use the Monte Carlo method to study pattern formation and phase transitions towards cooperation in social dilemmas that are driven by reinforcement learning. We distinguish local and global players according to the source of the stimulus they experience. While global players receive their stimuli from the whole neighborhood, local players focus solely on individual performance. We show that global players play a decisive role in ensuring cooperation, while local players fail in this regard, although both types of players show properties of 'moody cooperators'. In particular, global players evoke stronger conditional cooperation in their neighborhoods based on direct reciprocity, which is rooted in the emerging spatial patterns and stronger interfaces around cooperative clusters.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据