遗憾测试：在不知有对手的情况下学习纳什均衡

Regret Testing: Learning to Play Nash Equilibrium Without Knowing You Have an Opponent

Theoretical Economics · 2006

被引 168

人大 AABS 4

Dean P. Foster
H. Peyton Young

中文导读

提出一类简单且完全解耦的学习规则，玩家无需知道对手的收益或行动，就能在有限两人博弈中逐步逼近纳什均衡行为。

Abstract

A learning rule is uncoupled if a player does not condition his strategy on the opponent's payoffs. It is radically uncoupled if a player does not condition his strategy on the opponent's actions or payoffs. We demonstrate a family of simple, radically uncoupled learning rules whose period-by-period behavior comes arbitrarily close to Nash equilibrium behavior in any finite two-person game.

后悔测试无耦合学习纳什均衡有限二人博弈