Dual-Directed Algorithm Design for Efficient Pure Exploration
提出统一的双导向框架,将top-two方法从最优臂识别扩展到一般纯探索问题,理论证明渐近最优性,实验显示显著优于现有方法。
Although experimental design often focuses on selecting the single best alternative from a finite set, many pure-exploration problems pursue richer goals. Given a specific goal, adaptive experimentation aims to achieve it by strategically allocating sampling effort, with the underlying sample complexity characterized by a maximin optimization problem. In "Dual-Directed Algorithm Design for Efficient Pure Exploration," Qin and You introduce a unified dual-directed framework for efficiently solving general pure-exploration problems, yielding a unified algorithm design principle that extends the top-two approach beyond best-arm identification. Their theoretical analysis proves asymptotic optimality for classical problems, such as Gaussian best-arm identification, thresholding bandits, and epsilon-best-arm identification. Extensive numerical experiments confirm these theoretical insights, showcasing significant improvements over existing methods. This dual-directed framework offers researchers and practitioners a powerful and versatile tool to navigate uncertainty and optimize exploration strategies effectively.