Efficient multiple‐robust estimation for nonresponse data under informative sampling
针对概率抽样中的无应答问题,提出一种将抽样权重视为随机变量的统计框架,通过两步经验似然方法实现多重稳健估计,并整合外部汇总统计量提升效率,适用于复杂调查数据分析。
Abstract Nonresponse in probability sampling presents a long‐standing challenge in survey sampling, often necessitating simultaneous adjustments to address sampling and selection biases. We develop a statistical framework that explicitly models sampling weights as random variables and establish the semiparametric efficiency bound for the parameter of interest under nonresponse. This study investigates strategies for eliminating bias and effectively utilizing available information, extending beyond nonresponse issues to data integration with external summary statistics. The proposed estimators are characterized by their efficiency and double robustness. However, realizing full efficiency hinges on the accurate specification of underlying models. To enhance robustness against potential model misspecification, we expand double robustness to multiple robustness through a novel two‐step empirical likelihood approach. A numerical study evaluates the finite‐sample performance of our methods. Additionally, we apply these methods to a dataset from the National Health and Nutrition Examination Survey, effectively integrating summary statistics from the National Health Interview Survey.