大规模服务系统中的服务中断

Service Interruptions in Large-Scale Service Systems

Management Science · 2009
被引 35
人大 A+FT50UTD24ABS 4*

中文导读

研究大规模服务系统中服务中断的影响,发现规模越大,系统在中断时越脆弱,恢复时间越长,并通过流体模型量化了这种影响。

Abstract

Large-scale service systems, where many servers respond to high demand, are appealing because they can provide great economy of scale, producing a high quality of service with high efficiency. Customer waiting times can be short, with a majority of customers served immediately upon arrival, while server utilizations remain close to 100%. However, we show that this confluence of quality and efficiency is not achieved without risk, because there can be severe congestion if the system does not operate as planned. In particular, we show that the large scale makes the system more vulnerable to service interruptions when (i) most customers remain waiting until they can be served, and (ii) when many servers are unable to function during the interruption, as may occur with a system-wide computer failure. Increasing scale leads to higher server utilizations, which in turn leads to longer recovery times from service interruptions and worse performance during such events. We quantify the impact of service interruptions with increasing scale by introducing and analyzing approximating deterministic fluid models. We also show that these fluid models can be obtained from many-server heavy-traffic limits.

大规模服务系统服务中断流体模型重流量极限