大规模排序与选择中贪婪过程的（令人惊讶的）样本最优性

The (Surprising) Sample Optimality of Greedy Procedures for Large-Scale Ranking and Selection

Management Science · 2024

被引 9

人大 A+FT50UTD24ABS 4*

Zaile Li · 复旦大学
W.C. Fan · 同济大学
L. Jeff Hong · 复旦大学

中文导读

研究发现，在大规模排序与选择问题中，简单的贪婪过程（始终采样当前均值最大的备选方案）具有样本最优性，即所需样本量随备选方案数量线性增长，并通过理论证明和数值实验验证了其有效性。

Abstract

Ranking and selection (R&S) aims to select the best alternative with the largest mean performance from a finite set of alternatives. Recently, considerable attention has turned toward the large-scale R&S problem which involves a large number of alternatives. Ideal large-scale R&S procedures should be sample optimal; that is, the total sample size required to deliver an asymptotically nonzero probability of correct selection (PCS) grows at the minimal order (linear order) in the number of alternatives, k. Surprisingly, we discover that the naïve greedy procedure, which keeps sampling the alternative with the largest running average, performs strikingly well and appears sample optimal. To understand this discovery, we develop a new boundary-crossing perspective and prove that the greedy procedure is sample optimal for the scenarios where the best mean maintains at least a positive constant away from all other means as k increases. We further show that the derived PCS lower bound is asymptotically tight for the slippage configuration of means with a common variance. For other scenarios, we consider the probability of good selection and find that the result depends on the growth behavior of the number of good alternatives: if it remains bounded as k increases, the sample optimality still holds; otherwise, the result may change. Moreover, we propose the explore-first greedy procedures by adding an exploration phase to the greedy procedure. The procedures are proven to be sample optimal and consistent under the same assumptions. Last, we numerically investigate the performance of our greedy procedures in solving large-scale R&S problems. This paper was accepted by Baris Ata, stochastic models and simulation. Funding: This work was supported by the National Natural Science Foundation of China [Grants 72091211, 72071146, 72161160340]. Supplemental Material: The e-companion and data files are available at https://doi.org/10.1287/mnsc.2023.00694 .

贪婪过程样本最优性大规模排序与选择正确选择概率

阅读原文 ↗