What do discounted optima converge to?: A theory of discount rate asymptotics in economic models
研究贴现率趋近于零时,折扣最优策略的极限性质,给出其收敛于长期平均最优或追赶最优策略的条件,并引入强长期平均准则,该准则更易使用且具有递归表示。
This paper considers a sequence of stochastic dynamic problems in which the discount rate goes to zero. Sufficient conditions are given under which the limit of discounted optimal policies (and normalized values) are a (i) long-run average optimal and (ii) catching-up optimal policy (and value). These conditions are shown to be satisfied in many economic models. A new decision criterion for the undiscounted problem, the strong long-run average, is introduced. This criterion refines the long-run average, is often equivalent to the catching-up, and is easier to use than the latter since it has a recursive representation. A generalization of Fatou's lemma, which has wider applicability, is also proved.