Cost-Effective Acquisition of First-Party Data for Business Analytics
研究如何通过拍卖方法(如广义第二价格拍卖)以较低成本获取高质量的一手客户数据,并建立优化模型在预算约束下最大化数据质量,实验表明该方法能提高响应率、减少选择偏差并提升预测准确性。
Customer data acquisition is an important task in data-driven business analytics. Recently, there has been a growing interest in the effective use of an organization’s internal customer data, also known as first-party data. This work studies the acquisition of new data for business analytics based on first-party data resource. We address issues related to both acquisition cost and data quality. To reduce acquisition cost, we consider using auction-based methods, such as the generalized second price (GSP) auction, for acquiring data with differential prices for different customers. We find that the GSP-based data acquisition method incurs a lower cost and/or achieves a higher response rate than fixed price methods. To maximize data quality, we propose novel optimization models for different data acquisition methods and data quality measures. The proposed models maximize the quality of the acquired data while satisfying budget constraints. We derive and discuss the solutions to the optimization models analytically and provide managerial insights from the solutions. The proposed approach is effective in increasing customer responses, reducing selection bias, and enabling more accurate estimation and prediction for business analytics. The results of the experimental evaluation demonstrate the advantage of the proposed approach over existing data acquisition methods. History: Accepted by Ram Ramesh, Area Editor for Data Science and Machine Learning. Supplemental Material: The software that supports the findings of this study is available within the paper and its Supplemental Information ( https://pubsonline.informs.org/doi/suppl/10.1287/ijoc.2022.0037 ) as well as from the IJOC GitHub software repository ( https://github.com/INFORMSJoC/2022.0037 ). The complete IJOC Software and Data Repository is available at https://informsjoc.github.io/ .