自相关误差回归的频域选择准则

A Frequency Domain Selection Criterion for Regression With Autocorrelated Errors

Journal of the American Statistical Association · 1990
被引 5
ABS 4

中文导读

提出一种基于频域交叉验证的数据驱动方法,用于在误差自相关结构未知时选择回归函数的参数或非参数估计量,适用于瞬态信号或增长衰减曲线。

Abstract

Abstract We consider the regression model yt = η t + ε t , t = 0, 1, …, n − 1, where y t are scalar observations, η t is the unknown regression function, and ε t are unobservable errors generated by a zero-mean weakly stationary process, independent of η t and with completely unknown autocorrelation structure. We propose a data-driven method for selecting a parametric or nonparametric estimator of η t . The method is based on cross-validation in the frequency domain and requires no assumptions about the form of the estimator or the error correlations. It does, however, require the discrete Fourier transform (DFT) of the signal η t to be a smooth complex function of frequency, as is the case, for example, with transient signals or growth and decay curves. After giving some general motivations for the method, we focus on the special case of linear estimators of a nonparametric regression function, including both kernel and spline estimators. For these estimators, we develop efficient methods of evaluating the frequency domain cross-validation (FDCV) function. The standard time domain cross-validation (TDCV) method, which leaves out data points one at a time, is sensible only when the errors are independent. Autocorrelation among the errors can cause severe biases in the TDCV function, leading to poor selections. FDCV leaves out discrete Fourier transform values one at a time. These values are approximately independent regardless of the error correlation structure, and hence FDCV remains valid even for correlated errors, as long as the DFT of η t at the omitted frequency can be predicted from those remaining. Asymptotic properties of FDCV are given for a class of transient signals. Then the usefulness of FDCV for transient and other signals is demonstrated in a Monte Carlo study comparing the performances of TDCV and FDCV for selecting a kernel estimator of a nonparametric regression function. The use of FDCV is illustrated with data on international airline travel.

非参数回归自相关误差频域交叉验证核估计样条估计