Interdiction Games on Markovian PERT Networks
研究了随机阻断博弈中,扩散者最小化核武器项目期望工期、阻断者通过延迟任务最大化工期的静态和动态决策问题,并推广到考虑行动结果不确定性和加速博弈的情形。
In a stochastic interdiction game a proliferator aims to minimize the expected duration of a nuclear weapons development project, and an interdictor endeavors to maximize the project duration by delaying some of the project tasks. We formulate static and dynamic versions of the interdictor’s decision problem where the interdiction plan is either precommitted or adapts to new information revealed over time, respectively. The static model gives rise to a stochastic program, whereas the dynamic model is formalized as a multiple optimal stopping problem in continuous time and with decision-dependent information. Under a memoryless probabilistic model for the task durations, we prove that the static model reduces to a mixed-integer linear program, whereas the dynamic model reduces to a finite Markov decision process in discrete time that can be solved via efficient value iteration. We then generalize the dynamic model to account for uncertainty in the outcomes of the interdiction actions. We also discuss a crashing game where the proliferator can use limited resources to expedite tasks so as to counterbalance the interdictor’s efforts. The resulting problem can be formulated as a robust Markov decision process. This paper was accepted by Dimitris Bertsimas, optimization.