全渠道零售商退货影响建模

Modelling the influence of returns for an omni-channel retailer

European Journal of Operational Research · 2022

被引 36

ABS 4

Joost Goedhart 通讯
R. Haijema
Renzo Akkerman

中文导读

研究全渠道零售商在线上线下渠道间分配库存并考虑多周期销售依赖退货的决策问题，构建马尔可夫决策过程模型，并用深度强化学习求解大规模实例。

Abstract

More brick-and-mortar retailers open an online channel to increase sales. Often, they use the store to fulfil online orders and to receive returned products. The uncertain product returns however complicate the replenishment decision of a retailer. The inventory also has to be rationed over the offline and online sales channels. We therefore integrate the rationing and ordering decisions of an omni-channel retailer in a Markov Decision Process (MDP) that maximises the retailer’s profit. Contrary to previous studies, we explicitly model multi-period sales-dependent returns, which is more realistic and leads to higher profit and service levels. With Value Iteration (VI) an exact solution can only be computed for relatively small-scale instances. For solving large-scale instances, we constructed a Deep Reinforcement Learning (DRL) algorithm. The different methods are compared in an extensive numerical study of small-scale instances to gain insights. The results show that the running time of VI increases exponentially in the problem size, while the running time of DRL is high but scales well. DRL has a low optimality gap but the performance drops when there is a higher level of uncertainty or if the profit trade-off between different actions is minimal. Our approach of modelling multi-period sales-dependent product returns outperforms other methods. Furthermore, based on large-scale instances, we find that increasing online returns lowers the profit and the service level in the offline channel. However, longer return windows do not influence the retailer’s profit.

运营管理库存管理全渠道零售强化学习

阅读原文 ↗