基于分位数相关的变量选择

Quantile Correlation-based Variable Selection

Journal of Business & Economic Statistics · 2021
被引 4
人大 AABS 4

中文导读

提出一种基于分位数相关的多重检验和逐步选择方法,用于高维数据中识别重要变量,无需预设模型,计算高效且理论性质良好。

Abstract

This article is concerned with identifying important features in high-dimensional data analysis, especially when there are complex relationships among predictors. Without any specification of an actual model, we first introduce a multiple testing procedure based on the quantile correlation to select important predictors in high dimensionality. The quantile-correlation statistic is able to capture a wide range of dependence. A stepwise procedure is studied for further identifying important variables. Moreover, a sure independent screening based on the quantile correlation is developed in handling ultrahigh dimensional data. It is computationally efficient and easy to implement. We establish the theoretical properties under mild conditions. Numerical studies including simulation studies and real data analysis contain supporting evidence that the proposal performs reasonably well in practical settings.

高维变量选择分位数相关系数多重检验独立筛选