4.6 Article

Introspection dynamics: a simple model of counterfactual learning in asymmetric games

期刊

NEW JOURNAL OF PHYSICS
卷 24, 期 6, 页码 -

出版社

IOP Publishing Ltd
DOI: 10.1088/1367-2630/ac6f76

关键词

evolutionary game theory; counterfactual learning; myopic updating; asymmetric games; social dilemmas; volunteer's dilemma

资金

  1. European Research Council [850529]
  2. Max Planck Society
  3. European Research Council (ERC) [850529] Funding Source: European Research Council (ERC)

向作者/读者索取更多资源

Social behavior can be studied as an evolutionary process, where individuals make decisions between different strategies and those strategies that give a fitness advantage tend to spread. Traditional evolutionary game theory focuses on symmetric games, but interactions are often asymmetric. Therefore, we propose an alternative learning process called introspection dynamics that applies to arbitrary asymmetric games.
Social behavior in human and animal populations can be studied as an evolutionary process. Individuals often make decisions between different strategies, and those strategies that yield a fitness advantage tend to spread. Traditionally, much work in evolutionary game theory considers symmetric games: individuals are assumed to have access to the same set of strategies, and they experience the same payoff consequences. As a result, they can learn more profitable strategies by imitation. However, interactions are oftentimes asymmetric. In that case, imitation may be infeasible (because individuals differ in the strategies they are able to use), or it may be undesirable (because individuals differ in their incentives to use a strategy). Here, we consider an alternative learning process which applies to arbitrary asymmetric games, introspection dynamics. According to this dynamics, individuals regularly compare their present strategy to a randomly chosen alternative strategy. If the alternative strategy yields a payoff advantage, it is more likely adopted. In this work, we formalize introspection dynamics for pairwise games. We derive simple and explicit formulas for the abundance of each strategy over time and apply these results to several well-known social dilemmas. In particular, for the volunteer's timing dilemma, we show that the player with the lowest cooperation cost learns to cooperate without delay.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据