🌙

因子增强稀疏穿透深度ReLU神经网络用于高维回归

Factor Augmented Sparse Throughput Deep ReLU Neural Networks for High Dimensional Regression

Journal of the American Statistical Association · 2023
被引 22 · 同刊同年前 8%
ABS 4

中文导读

提出因子增强稀疏穿透(FAST)模型,结合潜在因子和稀疏特质成分进行非参数回归,适用于强依赖和弱依赖协变量,并给出基于深度ReLU网络的估计量及其理论性质。

Abstract

This article introduces a Factor Augmented Sparse Throughput (FAST) model that uses both latent factors and sparse idiosyncratic components for nonparametric regression. It contains many popular statistical models. The FAST model bridges factor models on one end and sparse nonparametric models on the other end. It encompasses structured nonparametric models such as factor augmented additive models and sparse low-dimensional nonparametric interaction models and covers the cases where the covariates do not admit factor structures. This model allows us to conduct high-dimensional nonparametric model selection for both strong dependent and weak dependent covariates and hence contributes to interpretable machine learning, particularly to the feature selections for neural networks. Via diversified projections as estimation of latent factor space, we employ truncated deep ReLU networks to nonparametric factor regression without regularization and to a more general FAST model using nonconvex regularization, resulting in factor augmented regression using neural network (FAR-NN) and FAST-NN estimators, respectively. We show that FAR-NN and FAST-NN estimators adapt to the unknown low-dimensional structure using hierarchical composition models in nonasymptotic minimax rates. We also study statistical learning for the factor augmented sparse additive model using a more specific neural network architecture. Our results are applicable to the weak dependent cases without factor structures. In proving the main technical result for FAST-NN, we establish a new deep ReLU network approximation result that contributes to the foundation of neural network theory. Numerical studies further support our theory and methods. Supplementary materials for this article are available online.

非参数统计高维回归因子模型神经网络机器学习