学会何时停止搜索

Learning When to Stop Searching

Management Science · 2019

被引 42

人大 A+FT50UTD24ABS 4*

Daniel G. Goldstein · 微软（美国）
R. Preston McAfee · 微软（美国）
Siddharth Suri · 微软（美国）
James R. Wright · 阿尔伯塔大学

中文导读

通过重复秘书问题实验，发现人们在重复搜索中能学习并接近最优停止策略，贝叶斯模型比较表明阈值模型最佳。

Abstract

In the classical secretary problem, one attempts to find the maximum of an unknown and unlearnable distribution through sequential search. In many real-world searches, however, distributions are not entirely unknown and can be learned through experience. To investigate learning in such settings, we conduct a large-scale behavioral experiment in which people search repeatedly from fixed distributions in a “repeated secretary problem.” In contrast to prior investigations that find no evidence for learning in the classical scenario, in the repeated setting we observe substantial learning resulting in near-optimal stopping behavior. We conduct a Bayesian comparison of multiple behavioral models, which shows that participants’ behavior is best described by a class of threshold-based models that contains the theoretically optimal strategy. Fitting such a threshold-based model to data reveals players’ estimated thresholds to be close to the optimal thresholds after only a small number of games. This paper was accepted by Yuval Rottenstreich, judgment and decision making.

最优停止理论重复秘书问题阈值模型行为学习

阅读原文 ↗