利用随机化打破维度诅咒

Using Randomization to Break the Curse of Dimensionality

Econometrica · 1997
被引 408
人大 A+FT50ABS 4*

中文导读

为有限和无限期界的马尔可夫决策问题引入随机版本的逐次逼近法和多重网格算法,并证明这些算法能打破一类离散决策过程的维度诅咒。

Abstract

This paper introduces random versions of successive approximations and multigrid algorithms for computing approximate solutions to a class of finite and infinite horizon Markovian decision problems (MDPs). We prove that these algorithms succeed in breaking the curse of dimensionality for a subclass of MDPs known as discrete decision processes (DDPs).

随机化算法维数诅咒马尔可夫决策过程离散决策过程