4.7 Article

Inferring strategies from observations in long iterated Prisoner's dilemma experiments

期刊

SCIENTIFIC REPORTS
卷 12, 期 1, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/s41598-022-11654-2

关键词

-

资金

  1. Flemish Government through the AI Research Program
  2. TAILOR - EU [952215]
  3. F.N.R.S Charge de Recherche position [40005955]
  4. FWO -Research Foundation Flanders
  5. F.N.R.S. project [31257234]
  6. F.W.O. project [G.0391.13N]
  7. FuturICT 2.0 - FLAG-ERA JCT
  8. Service Public de Wallonie Recherche [2010235-ARIAC]

向作者/读者索取更多资源

In the Iterated Prisoner's dilemma, fixed partner interaction leads to behavioral self-organization, while shuffled partners generate subgroups of memory-one strategies that block the self-selection process. Furthermore, longer treatments are needed to accurately capture the learning phase participants go through.
While many theoretical studies have revealed the strategies that could lead to and maintain cooperation in the Iterated Prisoner's dilemma, less is known about what human participants actually do in this game and how strategies change when being confronted with anonymous partners in each round. Previous attempts used short experiments, made different assumptions of possible strategies, and led to very different conclusions. We present here two long treatments that differ in the partner matching strategy used, i.e. fixed or shuffled partners. Here we use unsupervised methods to cluster the players based on their actions and then Hidden Markov Model to infer what the memory-one strategies are in each cluster. Analysis of the inferred strategies reveals that fixed partner interaction leads to behavioral self-organization. Shuffled partners generate subgroups of memory-one strategies that remain entangled, apparently blocking the self-selection process that leads to fully cooperating participants in the fixed partner treatment. Analyzing the latter in more detail shows that AllC, AllD, TFT- and WSLS-like behavior can be observed. This study also reveals that long treatments are needed as experiments with less than 25 rounds capture mostly the learning phase participants go through in these kinds of experiments.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据