A Comparative Empirical Study of Discrete Choice Models in Retail Operations
系统比较了多种离散选择模型(如MNL、混合Logit、马尔可夫链模型)和估计算法(如列生成、EM算法),通过合成、半合成和真实数据实验,评估了预测能力和收入表现,并给出了不同运营环境下的模型选择建议。
Choice-based demand estimation is a fundamental task in retail operations and revenue management, providing necessary input data for inventory control, assortment, and price-optimization models. The task is particularly difficult in operational contexts where product availability varies over time and customers may substitute into the available options. In addition to the classical multinomial logit (MNL) model and extensions (e.g., nested logit, mixed logit, and latent-class MNL), new demand models have been proposed (e.g., the Markov chain model), and others have been recently revisited (e.g., the rank list-based and exponomial models). At the same time, new computational approaches were developed to ease the estimation function (e.g., column-generation and expectation-maximization (EM) algorithms). In this paper, we conduct a systematic, empirical study of different choice-based demand models and estimation algorithms, including both maximum-likelihood and least-squares criteria. Through an exhaustive set of numerical experiments on synthetic, semisynthetic, and real data, we provide comparative statistics of the predictive power and derived revenue performance of an ample collection of choice models and characterize operational environments suitable for different model/estimation implementations. We also provide a survey of all the discrete choice models evaluated and share all our estimation codes and data sets as part of the online appendix. This paper was accepted by Vishal Gaur, operations management.