良性过拟合与含噪特征

Benign Overfitting and Noisy Features

Journal of the American Statistical Association · 2022

被引 13

ABS 4

Zhu Li 通讯
Weijie Su
Dino Sejdinović

中文导读

研究了随机特征模型中良性过拟合现象的条件，发现特征中的噪声起到隐式正则化作用，并首次证明过参数化模型能达到极小化最优学习率。

Abstract

Modern machine learning models often exhibit the benign overfitting phenomenon, which has recently been characterized using the double descent curves. In addition to the classical U-shaped learning curve, the learning risk undergoes another descent as we increase the number of parameters beyond a certain threshold. In this article, we examine the conditions under which benign overfitting occurs in the random feature (RF) models, that is, in a two-layer neural network with fixed first layer weights. Adopting a novel view of random features, we show that benign overfitting emerges because of the noise residing in such features. The noise may already exist in the data and propagates to the features, or it may be added by the user to the features directly. Such noise plays an implicit yet crucial regularization role in the phenomenon. In addition, we derive the explicit tradeoff between the number of parameters and the prediction accuracy, and for the first time demonstrate that overparameterized model can achieve the optimal learning rate in the minimax sense. Finally, our results indicate that the learning risk for overparameterized models has multiple, instead of double descent behavior, which is empirically verified in recent works. Supplementary materials for this article are available online.

机器学习过拟合随机特征模型正则化

阅读原文 ↗