遗憾测试:在不知有对手的情况下学习纳什均衡

Regret Testing: Learning to Play Nash Equilibrium Without Knowing You Have an Opponent

Theoretical Economics · 2006
被引 168
人大 AABS 4

中文导读

提出一类简单且完全解耦的学习规则,玩家无需知道对手的收益或行动,就能在有限两人博弈中逐步逼近纳什均衡行为。

Abstract

A learning rule is uncoupled if a player does not condition his strategy on the opponent's payoffs. It is radically uncoupled if a player does not condition his strategy on the opponent's actions or payoffs. We demonstrate a family of simple, radically uncoupled learning rules whose period-by-period behavior comes arbitrarily close to Nash equilibrium behavior in any finite two-person game.

后悔测试无耦合学习纳什均衡有限二人博弈