有限信息下的监控

Monitoring with Limited Information

Management Science · 2020
被引 10
人大 A+FT50UTD24ABS 4*

中文导读

研究了决策者在仅能有限次监控系统状态时,如何选择监控时机和停止策略以最大化奖励,提出鲁棒优化方法,并在心脏移植患者监控中验证了有效性。

Abstract

We consider a system with an evolving state that can be stopped at any time by a decision maker (DM), yielding a state-dependent reward. The DM does not observe the state except for a limited number of monitoring times, which he must choose, in conjunction with a suitable stopping policy, to maximize his reward. Dealing with these types of stopping problems, which arise in a variety of applications from healthcare to finance, often requires excessive amounts of data for calibration purposes and prohibitive computational resources. To overcome these challenges, we propose a robust optimization approach, whereby adaptive uncertainty sets capture the information acquired through monitoring. We consider two versions of the problem—static and dynamic—depending on how the monitoring times are chosen. We show that, under certain conditions, the same worst-case reward is achievable under either static or dynamic monitoring. This allows recovering the optimal dynamic monitoring policy by resolving static versions of the problem. We discuss cases when the static problem becomes tractable and highlight conditions when monitoring at equidistant times is optimal. Lastly, we showcase our framework in the context of a healthcare problem (monitoring heart-transplant patients for cardiac allograft vasculopathy), where we design optimal monitoring policies that substantially improve over the status quo recommendations. This paper was accepted by Chung Piaw Teo, optimization.

有限信息监控鲁棒优化最优停止策略自适应不确定性集