带噪声的方向性学习与对数均衡

Noisy Directional Learning and the Logit Equilibrium

Scandinavian Journal of Economics · 2004
被引 66
人大 A-ABS 3

中文导读

构建了一个动态模型,其中代理人受正态误差影响,向更高收益方向调整决策,该过程由福克-普朗克方程描述,并在所有势博弈中稳定。均衡时,决定期望收益的分布与对数函数应用于这些期望收益产生的分布一致,形成纳什均衡的随机推广,可解释实验室异常数据。

Abstract

Abstract We specify a dynamic model in which agents adjust their decisions toward higher payoffs, subject to normal error. This process generates a probability distribution of players’ decisions that evolves over time according to the Fokker–Planck equation. The dynamic process is stable for all potential games, a class of payoff structures that includes several widely studied games. In equilibrium, the distributions that determine expected payoffs correspond to the distributions that arise from the logit function applied to those expected payoffs. This “logit equilibrium” forms a stochastic generalization of the Nash equilibrium and provides a possible explanation of anomalous laboratory data.

Logit均衡噪声方向学习势博弈