☆ 4.6 Article

Local and global stimuli in reinforcement learning

NEW JOURNAL OF PHYSICS (2021)

期刊

NEW JOURNAL OF PHYSICS

卷 23, 期 8, 页码 -

出版社

IOP Publishing Ltd

DOI: 10.1088/1367-2630/ac170a

关键词

reinforcement learning; local and global stimuli; conditional cooperation; moody conditional cooperation

类别

Physics, Multidisciplinary

资金

National Natural Science Foundation for Distinguished Young Scholars [62025602]
National Natural Science Foundation of China [U1803263, 11931015, 81961138010]
National Key R&D Program of China [2019YFB2102304, 2018YFB1403501]
Fok Ying-Tong Education Foundation, China [171105]
Key Technology Research and Development Program of Science and Technology-Scientific and Technological Innovation Team of Shaanxi Province [2020TD-013]
Key Area R&D Program of Guangdong Province [2019B010137004]
Slovenian Research Agency [P1-0403, J1-2457, J1-9112]
National Natural Science Foundation of China (NNSFC) [11931015, 11671348]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Reinforcement learning is an alternative to imitation and exploration in resolving social dilemmas, where individuals adjust strategies based on their own past performance and preset aspirations. Stimuli play a crucial role in determining whether a strategy should be retained.

In efforts to resolve social dilemmas, reinforcement learning is an alternative to imitation and exploration in evolutionary game theory. While imitation and exploration rely on the performance of neighbors, in reinforcement learning individuals alter their strategies based on their own performance in the past. For example, according to the Bush-Mosteller model of reinforcement learning, an individual's strategy choice is driven by whether the received payoff satisfies a preset aspiration or not. Stimuli also play a key role in reinforcement learning in that they can determine whether a strategy should be kept or not. Here we use the Monte Carlo method to study pattern formation and phase transitions towards cooperation in social dilemmas that are driven by reinforcement learning. We distinguish local and global players according to the source of the stimulus they experience. While global players receive their stimuli from the whole neighborhood, local players focus solely on individual performance. We show that global players play a decisive role in ensuring cooperation, while local players fail in this regard, although both types of players show properties of 'moody cooperators'. In particular, global players evoke stronger conditional cooperation in their neighborhoods based on direct reciprocity, which is rooted in the emerging spatial patterns and stronger interfaces around cooperative clusters.

Local and global stimuli in reinforcement learning

期刊

NEW JOURNAL OF PHYSICS

出版社

IOP Publishing Ltd

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Local and global stimuli in reinforcement learning

期刊

NEW JOURNAL OF PHYSICS

出版社

IOP Publishing Ltd

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文