带有随机扰动的博弈学习动态

Learning Dynamics in Games with Stochastic Perturbations

Games and Economic Behavior · 1995

被引 123

人大 AABS 3

Yuri M. Kaniovski · 约翰霍普金斯大学
H. Peyton Young · 约翰霍普金斯大学

中文导读

研究了一种广义虚拟博弈，其中代理人的选择受到不完全信息、收益波动和随机颤抖的扰动，形成非平稳马尔可夫过程。利用随机逼近理论，证明在2×2博弈中，该过程几乎必然收敛到稳定纳什均衡附近，推广了Fudenberg和Kreps的结果。

Abstract

Consider a generalization of fictitious play in which agents′ choices are perturbed by incomplete information about what the other side has done, variability in their payoffs, and unexplained trembles. These perturbed best reply dynamics define a nonstationary Markov process on an infinite state space. It is shown, using results from stochastic approximation theory, that for 2 × 2 games it converges almost surely to a point that lies close to a stable Nash equilibrium, whether pure or mixed. This generalizes a result of Fudenherg and Kreps, who demonstrate convergence when the game has a unique mixed equilibrium. Journal of Economic Literature Classification Numbers: 000, 000, 000.

随机扰动虚拟博弈学习动态纳什均衡

阅读原文 ↗