基于检验的前向模型选择分析

Analysis of Testing‐Based Forward Model Selection

Econometrica · 2020
被引 1
人大 A+FT50ABS 4*

中文导读

分析了线性回归中基于统计检验的前向模型选择方法,证明了预测误差和所选协变量数量的概率界,并在异方差数据下与Lasso等估计量比较了收敛速度。

Abstract

This paper analyzes a procedure called Testing‐Based Forward Model Selection (TBFMS) in linear regression problems. This procedure inductively selects covariates that add predictive power into a working statistical model before estimating a final regression. The criterion for deciding which covariate to include next and when to stop including covariates is derived from a profile of traditional statistical hypothesis tests. This paper proves probabilistic bounds, which depend on the quality of the tests, for prediction error and the number of selected covariates. As an example, the bounds are then specialized to a case with heteroscedastic data, with tests constructed with the help of Huber–Eicker–White standard errors. Under the assumed regularity conditions, these tests lead to estimation convergence rates matching other common high‐dimensional estimators including Lasso.

线性回归协变量选择预测误差界