Social Learning in One-Arm Bandit Problems
研究两个玩家在离散时间下的单臂赌博机问题,其中风险臂有高低两种类型,停止实验不可逆,玩家观察彼此行动但不观察收益,证明所有均衡均为截断策略并给出截断序列的定性结果。
We study a two-player one-arm bandit problem in discrete time, in which the risky arm can have two possible types, high and low, the decision to stop experimenting is irreversible, and players observe each other's actions but not each other's payoffs. We prove that all equilibria are in cutoff strategies and provide several qualitative results on the sequence of cutoffs. Copyright The Econometric Society 2007.