A Unified View of the IPA, SF, and LR Gradient Estimation Techniques
揭示了似然比梯度估计与无穷小扰动分析之间的内在联系,通过重新定义样本空间将IPA视为LR/SF的特例,并给出无偏估计的充分条件。
We study the links between the likelihood-ratio (LR) gradient-estimation technique (sometimes called the score-function (SF) method), and infinitesimal perturbation analysis (IPA). We show how IPA can be viewed as a (degenerate) special case of the LR and SF techniques by selecting an appropriate representation of the underlying sample space for a given simulation experiment. We also show how different definitions of the sample space yield different variants of the LR method, some of them mixing IPA with more straightforward LR. We illustrate this by many examples. We also give sufficient conditions under which the gradient estimators are unbiased.