NONPARAMETRIC ESTIMATION OF REGRESSION FUNCTIONS WITH DISCRETE REGRESSORS
研究了仅含分类变量的非参数回归模型,通过最小二乘交叉验证选择平滑参数,证明了相关解释变量的平滑参数收敛于零,无关变量的平滑参数收敛于上界,从而自动剔除无关变量。
We consider the problem of estimating a nonparametric regression model containing categorical regressors only. We investigate the theoretical properties of least squares cross-validated smoothing parameter selection, establish the rate of convergence (to zero) of the smoothing parameters for relevant regressors, and show that there is a high probability that the smoothing parameters for irrelevant regressors converge to their upper bound values, thereby automatically smoothing out the irrelevant regressors. A small-scale simulation study shows that the proposed cross-validation-based estimator performs well in finite-sample settings.