实验性非对称信息博弈中的强化学习模型与信念学习模型比较

Reinforcement-based vs. Belief-based Learning Models in Experimental Asymmetric-information Games

Econometrica · 2000

被引 170

人大 A+FT50ABS 4*

Nick Feltovich · 休斯敦大学通讯

中文导读

通过新实验比较强化学习模型和信念学习模型在描述非对称信息博弈中受试者行为的能力，发现两者均优于纳什均衡，但优劣取决于实验和评判标准。

Abstract

This paper examines the abilities of learning models to describe subject behavior in experiments. A new experiment involving multistage asymmetric-information games is conducted, and the experimental data are compared with the predictions of Nash equilibrium and two types of learning model: a reinforcement-based model similar to that used by Roth and Erev (1995), and belief-based models similar to the ‘cautious fictitious play’ of Fudenberg and Levine (1995, 1998) These models make predictions that are qualitatively similar cycling around the Nash equilibrium that is much more apparent than movement toward it. While subject behavior is not adequately described by Nash equilibrium, it is consistent with the qualitative predictions of the learning models. We examine several criteria for quantitatively comparing the predictions of alternative models. According to almost all of these criteria, both types of learning model outperform Nash equilibrium. According to some criteria, the reinforcement-based model performs better than any version of the belief-based model; according to others, there exist versions of the belief-based model that outperform the reinforcement-based model. The abilities of these models are further tested with respect to the results of other published experiments. The relative performance of the two learning models depends on the experiment, and varies according to which criterion of success is used. Again, both models perform better than equilibrium in most cases.

强化学习信念学习非对称信息博弈实验经济学

阅读原文 ↗