🌙

高维动态主成分分析

Dynamic Principal Component Analysis in High Dimensions

Journal of the American Statistical Association · 2022
被引 14
ABS 4

中文导读

针对高维数据中协方差矩阵的动态变化,提出一种结合局部线性平滑和正则化惩罚的优化框架,直接估计动态特征向量,适用于常见和不规则设计的高维数据。

Abstract

Principal component analysis is a versatile tool to reduce dimensionality which has wide applications in statistics and machine learning. It is particularly useful for modeling data in high-dimensional scenarios where the number of variables p is comparable to, or much larger than the sample size n. Despite an extensive literature on this topic, researchers have focused on modeling static principal eigenvectors, which are not suitable for stochastic processes that are dynamic in nature. To characterize the change in the entire course of high-dimensional data collection, we propose a unified framework to directly estimate dynamic eigenvectors of covariance matrices. Specifically, we formulate an optimization problem by combining the local linear smoothing and regularization penalty together with the orthogonality constraint, which can be effectively solved by manifold optimization algorithms. We show that our method is suitable for high-dimensional data observed under both common and irregular designs, and theoretical properties of the estimators are investigated under lq(0≤q≤1) sparsity. Extensive experiments demonstrate the effectiveness of the proposed method in both simulated and real data examples.

统计学机器学习高维数据分析降维