理解多服务器服务系统的效率

Understanding the Efficiency of Multi-Server Service Systems

Management Science · 1992
被引 185
人大 A+FT50UTD24ABS 4*

中文导读

研究多服务器排队系统中服务器利用率与服务器数量的关系,解释规模经济效应,并给出近似公式计算平均等待时间和等待时间分布,帮助设计高效服务系统。

Abstract

In the design and operation of service systems, it is important to determine an appropriate level of server utilization (the proportion of time each server should be working). In a multi-server queue with unlimited waiting space, the appropriate server utilization typically increases as the number of servers (and the arrival rate) increases. We explain this economy of scale and give a rough quantitative characterization. We also show how increased variability in the arrival and service processes tends to reduce server utilization with a given grade of service. As part of this analysis, we develop simple approximations for the mean steady-state waiting time and the full steady-state waiting-time distribution. These approximations exploit an infinite-server approximation for the probability of delay and a single-server approximation for the conditional waiting-time distribution given that waiting occurs. The emphasis is on simple formulas that directly convey understanding.

多服务器排队系统服务器利用率规模经济等待时间近似