马尔可夫PERT网络上的阻断博弈

Interdiction Games on Markovian PERT Networks

Management Science · 2014

被引 26

人大 A+FT50UTD24ABS 4*

Eli Gutin · 麻省理工学院
Daniel Kühn · 洛桑联邦理工学院
Wolfram Wiesemann · 帝国理工学院

中文导读

研究了随机阻断博弈中，扩散者最小化核武器项目期望工期、阻断者通过延迟任务最大化工期的静态和动态决策问题，并推广到考虑行动结果不确定性和加速博弈的情形。

Abstract

In a stochastic interdiction game a proliferator aims to minimize the expected duration of a nuclear weapons development project, and an interdictor endeavors to maximize the project duration by delaying some of the project tasks. We formulate static and dynamic versions of the interdictor’s decision problem where the interdiction plan is either precommitted or adapts to new information revealed over time, respectively. The static model gives rise to a stochastic program, whereas the dynamic model is formalized as a multiple optimal stopping problem in continuous time and with decision-dependent information. Under a memoryless probabilistic model for the task durations, we prove that the static model reduces to a mixed-integer linear program, whereas the dynamic model reduces to a finite Markov decision process in discrete time that can be solved via efficient value iteration. We then generalize the dynamic model to account for uncertainty in the outcomes of the interdiction actions. We also discuss a crashing game where the proliferator can use limited resources to expedite tasks so as to counterbalance the interdictor’s efforts. The resulting problem can be formulated as a robust Markov decision process. This paper was accepted by Dimitris Bertsimas, optimization.

随机博弈马尔可夫PERT网络最优停止鲁棒马尔可夫决策过程

阅读原文 ↗