🌙

桥接鲁棒主成分分析中的凸优化与非凸优化:噪声、异常值与缺失数据

Bridging convex and nonconvex optimization in robust PCA: Noise, outliers and missing data

Annals of Statistics · 2021
被引 55
ABS 4*

中文导读

本文改进了鲁棒主成分分析中凸规划方法的理论保证,在随机噪声、稀疏异常值和缺失数据并存下,证明其能达到接近最优的统计精度,对处理含异常值的大规模数据有用。

Abstract

This paper delivers improved theoretical guarantees for the convex programming approach in low-rank matrix estimation, in the presence of (1) random noise, (2) gross sparse outliers and (3) missing data. This problem, often dubbed as robust principal component analysis (robust PCA), finds applications in various domains. Despite the wide applicability of convex relaxation, the available statistical support (particularly the stability analysis in the presence of random noise) remains highly suboptimal, which we strengthen in this paper. When the unknown matrix is well conditioned, incoherent and of constant rank, we demonstrate that a principled convex program achieves near-optimal statistical accuracy, in terms of both the Euclidean loss and the ℓ∞ loss. All of this happens even when nearly a constant fraction of observations are corrupted by outliers with arbitrary magnitudes. The key analysis idea lies in bridging the convex program in use and an auxiliary nonconvex optimization algorithm, and hence the title of this paper.

鲁棒主成分分析凸优化低秩矩阵估计异常值检测缺失数据