奖励塑造以提升深度强化学习在易逝品库存管理中的性能

Reward shaping to improve the performance of deep reinforcement learning in perishable inventory management

European Journal of Operational Research · 2021
被引 109 · 同刊同年前 5%
ABS 4
库存管理强化学习运营管理人工智能运筹学