🌙

完全不相关的稀疏主成分分析

Exactly Uncorrelated Sparse Principal Component Analysis

Journal of Computational and Graphical Statistics · 2023
被引 3
ABS 3

中文导读

提出一种名为EUSPCA的稀疏主成分分析方法,能产生完全不相关的主成分,同时保持与经典PCA相当的保真度,解决了现有方法产生相关成分的问题。

Abstract

Sparse principal component analysis (PCA) aims to find principal components as linear combinations of a subset of the original input variables without sacrificing the fidelity of the classical PCA. Most existing sparse PCA methods produce correlated sparse principal components. We argue that many applications of PCA prefer uncorrelated principal components. However, handling sparsity and uncorrelatedness properties in a sparse PCA method is nontrivial. This article proposes an exactly uncorrelated sparse PCA method named EUSPCA, whose formulation is motivated by original views and motivations of PCA as advocated by Pearson and Hotelling. EUSPCA is a non-smooth constrained non-convex manifold optimization problem. We solve it by combining augmented Lagrangian and non-monotone proximal gradient methods. We observe that EUSPCA produces uncorrelated components and maintains a similar or better level of fidelity based on adjusted total variance through simulated and real data examples. In contrast, existing sparse PCA methods produce significantly correlated components. Supplemental materials for this article are available online.

主成分分析稀疏PCA优化算法机器学习统计学