Machine Labor
通过实例说明机器学习在回归因果推断中的用途,如用lasso选择控制变量估计大学特征对工资的影响,并评估其在工具变量方法中的适用性,发现机器学习在劳动经济学的工具变量应用中效果不佳。
The utility of machine learning (ML) for regression-based causal inference is illustrated by using lasso to select control variables for estimates of college characteristics’ wage effects. Post-double-selection lasso offers a path to data-driven sensitivity analysis. ML also seems useful for an instrumental variables (IV) first stage, since two-stage least squares (2SLS) bias reflects overfitting. While ML-based instrument selection can improve on 2SLS, split-sample IV and limited information maximum likelihood do better. Finally, we use ML to choose IV controls. Here, ML creates artificial exclusion restrictions, generating spurious findings. On balance, ML seems ill-suited to IV applications in labor economics.