随机博弈的边际值

Marginal Values of a Stochastic Game

Mathematics of Operations Research · 2024
被引 1
ABS 3

中文导读

研究了零和随机博弈中贴现值和未贴现值的边际值(即值函数沿扰动的方向导数),给出了贴现情形的公式,并在温和假设下给出了贴现率趋于零时的极限公式以及未贴现情形的公式,发现后两者一般不同。

Abstract

Zero-sum stochastic games are parameterized by payoffs, transitions, and possibly a discount rate. In this article, we study how the main solution concepts, the discounted and undiscounted values, vary when these parameters are perturbed. We focus on the marginal values, introduced by Mills in 1956 in the context of matrix games—that is, the directional derivatives of the value along any fixed perturbation. We provide a formula for the marginal values of a discounted stochastic game. Further, under mild assumptions on the perturbation, we provide a formula for their limit as the discount rate vanishes and for the marginal values of an undiscounted stochastic game. We also show, via an example, that the two latter differ in general. Funding: This work was supported by Fondation CFM pour la Recherche; the European Research Council [Grant ERC-CoG-863818 (ForM-SMArt)]; and Agence Nationale de la Recherche [Grant ANR-21-CE40-0020].

博弈论随机博弈数学优化经济学