具有马尔可夫生产关系的生产规划

Production Planning with Markovian Production Relationships

Production and Operations Management · 2026
被引 0 · 同刊同年前 6%
人大 AFT50UTD24ABS 4

中文导读

研究了生产计划与实际产量间存在线性、非线性、确定或随机关系时的生产规划问题,提出基于深度强化学习的求解框架,在非线性情形下最优性差距约10%-20%。

Abstract

We study a production planning problem with linear, nonlinear, deterministic, and/or stochastic production relationships between the production plans and actual production quantities. We start by introducing a stochastic dynamic programing formulation of the problem with a Markovian assumption on the production relationships. Under specific conditions, we establish the convexity of the optimal cost-to-go function and closed forms of optimal policies. To solve the original problem in the general case, we propose a solution framework based on sequential policy optimization and deep reinforcement learning. We discuss the theoretical properties of the framework and evaluate its numerical performance with linear and nonlinear production relationships. In the linear case, our framework performs in line with the state-of-the-art optimization-based methods with improved computational efficiency. In the nonlinear case, our framework achieves an optimality gap near 10%–20%. We also illustrate that the proposed methodology can also be extended to the problem of joint production planning and scheduling.

生产规划随机规划马尔可夫过程深度强化学习