在收益抽样下封闭的策略集

Strategy sets closed under payoff sampling

Games and Economic Behavior · 2022

被引 8

人大 AABS 3

Segismundo S. Izquierdo · 巴利亚多利德大学通讯
Luis R. Izquierdo · 布尔戈斯大学

中文导读

研究了程序理性玩家通过随机匹配测试各策略并选择最优的收益抽样动态，刻画了该动态在所有博弈中静止点的支撑集，并分析了其所属面的渐近稳定性。

Abstract

We consider population games played by procedurally rational players who, when revising their current strategy, test each of their available strategies independently in a series of random matches –i.e., a battery of tests–, and then choose the strategy that performed best in this battery of tests. This revision protocol leads to the so-called payoff-sampling dynamics (aka test-all Best Experienced Payoff dynamics). In this paper we characterize the support of all the rest points of these dynamics in any game and analyze the asymptotic stability of the faces to which they belong. We do this by defining strategy sets closed under payoff sampling, and by proving that the identification of these sets can be made in terms of simple comparisons between some of the payoffs of the game.

策略集收益抽样不动点渐近稳定性

阅读原文 ↗