离散事件系统瞬态与稳态行为的优化

Optimization of the Transient and Steady-State Behavior of Discrete Event Systems

Management Science · 1996
被引 25
人大 A+FT50UTD24ABS 4*

中文导读

提出一个通用框架,通过将离散事件系统建模为一般状态空间马尔可夫链,利用仿真和似然比方法评估性能指标及其梯度,并保证Robbins-Monro随机逼近算法收敛到最优参数,适用于瞬态和稳态性能度量。

Abstract

We present a general framework for applying simulation to optimize the behavior of discrete event systems. Our approach involves modeling the discrete event system under study as a general state space Markov chain whose distribution depends on the decision parameters. We then show how simulation and the likelihood ratio method can be used to evaluate the performance measure of interest and its gradient, and we present conditions that guarantee that the Robbins-Monro stochastic approximation algorithm will converge almost surely to the optimal values of the decision parameters. Both transient and steady-state performance measures are considered. For steady-state performance measures, we consider both the case when the Markov chain of interest is regenerative in the standard sense, as well as the case when this Markov chain is Harris recurrent, and thereby regenerative in a wider sense.

离散事件系统仿真优化似然比方法随机逼近