基于Lasso的推断方法中的遗漏变量偏差:有限样本分析

Omitted Variable Bias of Lasso-Based Inference Methods: A Finite Sample Analysis

Review of Economics and Statistics · 2021
被引 1
人大 AFT50ABS 4

中文导读

研究了基于Lasso的推断方法(如post–double Lasso和debiased Lasso)在有限样本中的行为,发现即使系数稀疏且样本量大于控制变量数,这些方法仍可能因Lasso未选择相关控制变量而产生显著遗漏变量偏差,对实证应用中的渐近理论使用提出警示。

Abstract

Abstract We study the finite sample behavior of Lasso-based inference methods such as post–double Lasso and debiased Lasso. We show that these methods can exhibit substantial omitted variable biases (OVBs) due to Lasso's not selecting relevant controls. This phenomenon can occur even when the coefficients are sparse and the sample size is large and larger than the number of controls. Therefore, relying on the existing asymptotic inference theory can be problematic in empirical applications. We compare the Lasso-based inference methods to modern high-dimensional OLS-based methods and provide practical guidance.

Lasso遗漏变量偏误有限样本高维推断