实验性p-选美比赛中的迭代占优与迭代最优反应

Iterated Dominance and Iterated Best-Response in Experimental P-Beauty Contests

American Economic Review · 1996
被引 488
人大 A+FT50ABS 4*

中文导读

通过p-选美比赛实验,研究玩家迭代推理的步数,发现中位步数为1或2,重复游戏收敛到纳什均衡,且最优反应模型比学习方向理论更能解释后期选择。

Abstract

We study a dominance-solvable 'p-beauty contest' game in which a group of players simultaneously choose numbers from a closed interval. The winner is the player whose number is the closest top times the average, where p =/ 1. The numbers players choose can be taken as an indication of the number of steps of iterated reasoning about others they do. Choices in the first period show that the median number of steps of iterated reasoning is either one or two. Repeating the game produces reliable convergence to the unique Nash equilibrium. Choices in later periods are consistent with subjects' best-responding to previous choices, or iterating one step and best-responding to best responses. (Choices are not as consistent with 'learning direction theory' which embodies elements of belief-free reinforcement models). Variation in the values of p, the number of players, and whether subjects played a similar game before, all affect choices and learning.

迭代优势迭代最优反应纳什均衡收敛