当预测变量无/有测量误差时聚类数据的非参数函数估计

Nonparametric Function Estimation for Clustered Data When the Predictor is Measured without/with Error

Journal of the American Statistical Association · 2000
被引 72
ABS 4

中文导读

研究了聚类数据中局部多项式核回归的估计方法,发现忽略组内相关直接合并数据通常最优,但若预测变量有测量误差,加权平均估计量方差更小。

Abstract

We consider local polynomial kernel regression with a single covariate for clustered data using estimating equations. We assume that at most m < ∞ observations are available on each cluster. In the case of random regressors, with no measurement error in the predictor, we show that it is generally the best strategy to ignore entirely the correlation structure within each cluster and instead pretend that all observations are independent. In the further special case of longitudinal data on individuals with fixed common observation times, we show that equivalent to the pooled data approach is the strategy of fitting separate nonparametric regressions at each observation time and constructing an optimal weighted average. We also consider what happens when the predictor is measured with error. Using the SIMEX approach to correct for measurement error, we construct an asymptotic theory for both the pooled and the weighted average estimators. Surprisingly, for the same amount of smoothing, the weighted average estimators typically have smaller variances than the pooling strategy. We apply the proposed methods to analysis of the AIDS Costs and Services Utilization Survey.

非参数统计聚类数据测量误差局部多项式回归计量经济学