单臂赌博机问题中的社会学习

Social Learning in One-Arm Bandit Problems

Econometrica · 2007
被引 84
人大 A+FT50ABS 4*

中文导读

研究两个玩家在离散时间下的单臂赌博机问题,其中风险臂有高低两种类型,停止实验不可逆,玩家观察彼此行动但不观察收益,证明所有均衡均为截断策略并给出截断序列的定性结果。

Abstract

We study a two-player one-arm bandit problem in discrete time, in which the risky arm can have two possible types, high and low, the decision to stop experimenting is irreversible, and players observe each other's actions but not each other's payoffs. We prove that all equilibria are in cutoff strategies and provide several qualitative results on the sequence of cutoffs. Copyright The Econometric Society 2007.

社会学习单臂老虎机均衡策略截断策略