SPURIOUS FACTORS IN DATA WITH LOCAL-TO-UNIT ROOTS
将虚假因子分析扩展到具有异质局部单位根的高维数据,发现主成分分析估计的“因子”反映的是强时间相关性,而非真正的跨截面共性。
This paper extends the spurious factor analysis of Onatski and Wang (2021, Spurious factor analysis. Econometrica , 89(2), 591–614.) to high-dimensional data with heterogeneous local-to-unit roots. We find a spurious factor phenomenon similar to that observed in the data with unit roots. Namely, the “factors” estimated by the principal components analysis converge to principal eigenfunctions of a weighted average of the covariance kernels of the demeaned Ornstein–Uhlenbeck processes with different decay rates. Thus, such “factors” reflect the structure of the strong temporal correlation of the data and do not correspond to any cross-sectional commonalities, that genuine factors are usually associated with. Furthermore, the principal eigenvalues of the sample covariance matrix are very large relative to the other eigenvalues, creating an illusion of the “factors”capturing much of the data’s common variation. We conjecture that the spurious factor phenomenon holds, more generally, for data obtained from high frequency sampling of heterogeneous continuous time (or spacial) processes, and provide an illustration.