🌙

具有协方差估计的稳健多元套索回归

Robust Multivariate Lasso Regression with Covariance Estimation

Journal of Computational and Graphical Statistics · 2022
被引 8
ABS 3

中文导读

提出一种稳健的多元套索回归方法RMLC,通过Huber或Tukey损失函数抵抗异常值,同时估计回归系数和响应变量的协方差结构,模拟和真实数据分析显示其可靠性。

Abstract

Multivariate regression with covariance estimation (MRCE) is a method that performs sparse estimation of multivariate regression coefficients, while taking account the covariance structure of the response variables. MRCE uses a penalized likelihood approach to simultaneously estimate the regression coefficients and the inverse covariance matrix so that prediction accuracy can be significantly improved. However, traditional likelihood-based methods such as MRCE can produce very misleading results in the presence of outliers. In this work, we propose an extension of MRCE, namely, a robust multivariate lasso regression with covariance estimation (RMLC) to handle potential outliers within the data. By using Huber's loss or Tukey's biweight loss, RMLC can be resistant to outliers in the responses or in both the responses and the covariates. A novel optimization algorithm that incorporates a 2-fold accelerated proximal gradient (APG) algorithm is developed to solve RMLC efficiently. We also demonstrate that our proposed RMLC enjoys the oracle property. Our simulation study shows that RMLC produces very reliable results for both the regression coefficients and the correlation structure of the responses, even if the data are contaminated. A real analysis on hyperspectral data further demonstrates the utility of RMLC. Supplementary materials for this article are available online.

多元回归协方差估计稳健统计异常值处理套索回归