高维条件变量可加分离模型中的推断

Inference in Additively Separable Models With a High-Dimensional Set of Conditioning Variables

Journal of Business & Economic Statistics · 2020
被引 11
人大 AABS 4

中文导读

研究在条件变量z可能高维时,如何用非参数级数估计和推断单个变量x对结果y的效应,提出后非参数双重选择方法,并证明其渐近正态性。

Abstract

This paper studies nonparametric series estimation and inference for the effect of a single variable of interest x on an outcome y in the presence of potentially high-dimensional conditioning variables z. The context is an additively separable model E[y|x, z] = g0(x) + h0(z). The model is high-dimensional in the sense that the series of approximating functions for h0(z) can have more terms than the sample size, thereby allowing z to have potentially very many measured characteristics. The model is required to be approximately sparse: h0(z) can be approximated using only a small subset of series terms whose identities are unknown. This paper proposes an estimation and inference method for g0(x) called Post-Nonparametric Double Selection which is a generalization of Post-Double Selection. Standard rates of convergence and asymptotic normality for the estimator are shown to hold uniformly over a large class of sparse data generating processes. A simulation study illustrates finite sample estimation properties of the proposed estimator and coverage properties of the corresponding confidence intervals. Finally, an empirical application estimating convergence in GDP in a country-level crosssection demonstrates the practical implementation of the proposed method.

非参数级数估计高维控制变量可加分离模型后非参数双重选择