High‐Dimensional Inference for Single‐Index Models With Latent Factors
针对含潜在因子的单指标模型,提出无需估计高维系数的检验统计量和去偏估计量,适用于误差重尾或异常值场景,并通过基因数据验证了有效性。
ABSTRACT Models with latent factors have recently attracted considerable attention. However, most existing studies focus on linear regression models and therefore fail to capture potential nonlinear structures. To address this limitation, we consider the factor augmented single‐index model. We first examine whether the inclusion of the augmented component is necessary by introducing a score‐type test statistic. Unlike existing test statistics, the proposed one does not require estimating high‐dimensional regression coefficients or precision matrices, making it computationally simpler and more stable. To determine the critical value, we employ a Gaussian multiplier bootstrap, whose theoretical validity is established under mild regularity conditions. We further investigate the penalized estimation of the regression model. With estimated latent factors, we establish the error bounds of the estimators. In addition, we construct confidence intervals for individual coefficients based on a debiased estimator. Importantly, our method does not impose moment conditions on the error distribution, allowing it to perform well even when the errors are heavy‐tailed or contaminated by outliers. Comprehensive simulation studies and an application to a gene expression dataset demonstrate the effectiveness and robustness of the proposed procedure.