面向高效纯探索的双导向算法设计

Dual-Directed Algorithm Design for Efficient Pure Exploration

Operations Research · 2025

被引 0

人大 AFT50UTD24ABS 4*

Wei You · 香港科技大学
Chao Qin · 斯坦福大学

中文导读

提出统一的双导向框架，将top-two方法从最优臂识别扩展到一般纯探索问题，理论证明渐近最优性，实验显示显著优于现有方法。

Abstract

Although experimental design often focuses on selecting the single best alternative from a finite set, many pure-exploration problems pursue richer goals. Given a specific goal, adaptive experimentation aims to achieve it by strategically allocating sampling effort, with the underlying sample complexity characterized by a maximin optimization problem. In "Dual-Directed Algorithm Design for Efficient Pure Exploration," Qin and You introduce a unified dual-directed framework for efficiently solving general pure-exploration problems, yielding a unified algorithm design principle that extends the top-two approach beyond best-arm identification. Their theoretical analysis proves asymptotic optimality for classical problems, such as Gaussian best-arm identification, thresholding bandits, and epsilon-best-arm identification. Extensive numerical experiments confirm these theoretical insights, showcasing significant improvements over existing methods. This dual-directed framework offers researchers and practitioners a powerful and versatile tool to navigate uncertainty and optimize exploration strategies effectively.

计算机科学数学优化自适应实验纯探索问题

阅读原文 ↗