Reputation and Equilibrium Selection in Games with a Patient Player
研究一个长期玩家与一系列短期对手的重复博弈,证明若对手认为长期玩家可能坚持Stackelberg策略,则长期玩家的纳什均衡收益会接近Stackelberg收益。
A single long-run player plays a simultaneous-move stage game against a sequence of opponents who play only once, but observe all previous play. Let the be the pure strategy to which the long-run player would most like to commit himself. If there is positive prior probability that the long-run player will always play the Stackelberg strategy, then his payoff in any Nash equilibrium exceeds a bound that converges to the Stackelberg payoff as his discount factor approaches one. When the stage game is not simultaneous move, this result must be modified to account for the possibility that distinct strategies of the long-run player are observationally equivalent.