Impact of Server Capability on Pooling Configuration in Stochastic Service Systems
研究了在随机服务系统中,服务器能力水平、工作负载和需求不对称性如何影响资源池化配置,发现部分灵活系统在低服务器能力且不对称场景下优于完全灵活系统。
Resource pooling is often introduced in service systems to cope with the variability in customer demand. The primary motivation behind creating a more flexible system is to utilize resources efficiently—namely, assigning customers whose dedicated resources are fully occupied to available non-dedicated resources (referred to as off-service placement). In such a setup, the service system manager expects to serve more customers within a fixed timeframe. However, recent empirical evidence shows that, due to limited server capability, the service time at non-dedicated providers can be significantly longer than that at dedicated ones. In this study, we develop a two-server stochastic model to examine how server capability levels and other factors—such as overall workload and demand asymmetry—affect pooling configurations. We derive conditions that specify the optimal system flexibility configuration across these parameters. Our findings reveal that a partially flexible system can outperform a fully flexible one, particularly in asymmetric scenarios with low server capability. This advantage is also pronounced when considering a range of system costs, including server capability, cross-serving, and other related costs in stochastic service systems with and without buffers. These insights from our two-server model offer guidance on designing more efficient flexibility in complex multi-class, multi-server service systems.