连续博弈中利用最少信息的学习

Learning with minimal information in continuous games

Theoretical Economics · 2020

被引 6

人大 AABS 4

Sebastian Bervoets
Mathieu Faure
Mario Bravo · 智利圣地亚哥大学

中文导读

针对连续动作集博弈，设计了一种简单的随机学习规则，玩家仅根据自身收益变化更新动作，无需复杂推理，并证明在策略互补博弈、凹博弈和局部序势博弈中收敛到纳什均衡。

Abstract

While payoff‐based learning models are almost exclusively devised for finite action games, where players can test every action, it is harder to design such learning processes for continuous games. We construct a stochastic learning rule, designed for games with continuous action sets, which requires no sophistication from the players and is simple to implement: players update their actions according to variations in own payoff between current and previous action. We then analyze its behavior in several classes of continuous games and show that convergence to a stable Nash equilibrium is guaranteed in all games with strategic complements as well as in concave games, while convergence to Nash equilibrium occurs in all locally ordinal potential games as soon as Nash equilibria are isolated.

连续博弈最小信息学习随机学习规则纳什均衡收敛

阅读原文 ↗