情境逆优化：离线与在线学习

Contextual Inverse Optimization: Offline and Online Learning

Operations Research · 2023

被引 12

人大 AFT50UTD24ABS 4*

Omar Besbes · 哥伦比亚大学
Yuri Fonseca · 哥伦比亚大学
Ilan Lobel · 纽约大学

中文导读

研究如何从专家过去的最优决策数据中逆向推断其决策过程，并量化离线与在线两种数据收集方式下可达到的模仿性能。

Abstract

Learning from data are critical across applications. However, in many applications, past data only gives partial information about the future. In “Contextual Inverse Optimization: Offline and Online Learning,” Besbes, Fonseca, and Lobel study a general setting in which historical data are associated with observations of past optimal actions from experts in specific contexts but without the underlying rewards associated with these actions. To what extent can one “reverse engineer” the underlying decision-making process of experts and mimic them? The authors develop results that quantify the performance that is achievable given the data at hand in two types of settings: the offline setting in which data have already been collected and the online setting in which data are collected “on the fly.”

机器学习数据科学运筹学决策科学

阅读原文 ↗