噪声与昂贵密度函数的蒙特卡洛方法综述:在强化学习和近似贝叶斯计算中的应用

A Survey of Monte Carlo Methods for Noisy and Costly Densities With Application to Reinforcement Learning and ABC

International Statistical Review · 2024
被引 6 · 同刊同年前 8%
ABS 3

中文导读

综述了使用替代模型处理难以计算、昂贵或带噪声的密度函数的蒙特卡洛方法,分类描述了三种主要方法及其在强化学习和似然无关推断中的应用。

Abstract

Summary This survey gives an overview of Monte Carlo methodologies using surrogate models, for dealing with densities that are intractable, costly, and/or noisy. This type of problem can be found in numerous real‐world scenarios, including stochastic optimisation and reinforcement learning, where each evaluation of a density function may incur some computationally‐expensive or even physical (real‐world activity) cost, likely to give different results each time. The surrogate model does not incur this cost, but there are important trade‐offs and considerations involved in the choice and design of such methodologies. We classify the different methodologies into three main classes and describe specific instances of algorithms under a unified notation. A modular scheme that encompasses the considered methods is also presented. A range of application scenarios is discussed, with special attention to the likelihood‐free setting and reinforcement learning. Several numerical comparisons are also provided.

蒙特卡洛方法强化学习近似贝叶斯计算机器学习统计学