虚假因子分析

Spurious Factor Analysis

Econometrica · 2021
被引 28
人大 A+FT50ABS 4*

中文导读

揭示了在高维非平稳数据中,主成分分析会错误地提取出少数虚假因子,并建议实证研究者始终对比水平与差分分析以避免误判。

Abstract

This paper draws parallels between the principal components analysis of factorless high‐dimensional nonstationary data and the classical spurious regression. We show that a few of the principal components of such data absorb nearly all the data variation. The corresponding scree plot suggests that the data contain a few factors, which is corroborated by the standard panel information criteria. Furthermore, the Dickey–Fuller tests of the unit root hypothesis applied to the estimated “idiosyncratic terms” often reject, creating an impression that a few factors are responsible for most of the nonstationarity in the data. We warn empirical researchers of these peculiar effects and suggest to always compare the analysis in levels with that in differences.

虚假因子分析主成分分析非平稳数据伪回归