学习模型中的平均行为

Average behavior in learning models

Journal of Economic Theory · 1992
被引 43
人大 AABS 4

中文导读

研究一类适应性行为模型,其中遥远过去对当前行为影响微弱,且代理人偶尔犯错。结果表明,平均行为几乎必然收敛到唯一极限,错误作为均衡选择机制,在错误概率小时接近无错误模型的均衡。

Abstract

We examine a general class of adaptive behavior models in which the distant past has only a weak effect on current actions, and assume that agents sometimes make mistakes, to show that average behavior (averaged over time) converges, with probability one, to a unique limit. Mistakes generate global convergence and are an equilibrium selection device; for small mistake probabilities the equilibrium selected is close to an equilibrium of the model without mistakes. The overlapping generations model, and learning in games with bounded memory, fit into this framework and are examined as examples of the result.

平均行为收敛均衡选择学习模型有限记忆博弈