IPA、SF和LR梯度估计技术的统一视角

A Unified View of the IPA, SF, and LR Gradient Estimation Techniques

Management Science · 1990

被引 225

人大 A+FT50UTD24ABS 4*

Pierre L’Ecuyer · 拉瓦尔大学通讯

中文导读

揭示了似然比梯度估计与无穷小扰动分析之间的内在联系，通过重新定义样本空间将IPA视为LR/SF的特例，并给出无偏估计的充分条件。

Abstract

We study the links between the likelihood-ratio (LR) gradient-estimation technique (sometimes called the score-function (SF) method), and infinitesimal perturbation analysis (IPA). We show how IPA can be viewed as a (degenerate) special case of the LR and SF techniques by selecting an appropriate representation of the underlying sample space for a given simulation experiment. We also show how different definitions of the sample space yield different variants of the LR method, some of them mixing IPA with more straightforward LR. We illustrate this by many examples. We also give sufficient conditions under which the gradient estimators are unbiased.

梯度估计似然比方法无穷小扰动分析得分函数

阅读原文 ↗