Cross-Validation of Multivariate Densities
本文推导并比较了多元核密度估计的三种交叉验证方法,发现其理论性能随维度增加而改善,并通过心脏病和臭氧数据的双变量实例展示了新方法的良好表现。
Abstract In recent years, the focus of study in smoothing parameter selection for kernel density estimation has been on the univariate case, while multivariate kernel density estimation has been largely neglected. In part, this may be due to the perception that calibrating multivariate densities is substantially more difficult. In this article, we explicitly derive and compare multivariate versions of the bootstrap method of Taylor, the least-squares cross-validation method developed by Bowman and Rudemo, and a biased cross-validation method similar to that of Scott and Terrell for multivariate kernel estimation using the product kernel estimator. The theoretical behavior of these cross-validation algorithms is shown to improve (surprisingly) as the dimension increases, approaching the best rate of O(n −1/2). Simulation studies suggest that the new biased cross-validation method performs quite well and with reasonable variability as compared to the other two methods. Bivariate examples with heart disease and ozone data are given to illustrate the behavior of these algorithms.