🌙

用于估计总体规模的双重稳健捕获-再捕获方法

Doubly Robust Capture-Recapture Methods for Estimating Population Size

Journal of the American Statistical Association · 2023
被引 3
ABS 4

中文导读

提出一种双重稳健的捕获-再捕获估计量,在非参数条件下接近最优,并给出总体规模的置信区间构建方法,应用于秘鲁武装冲突期间死亡和失踪人数的估计。

Abstract

Estimation of population size using incomplete lists has a long history across many biological and social sciences. For example, human rights groups often construct partial lists of victims of armed conflicts, to estimate the total number of victims. Earlier statistical methods for this setup often use parametric assumptions, or rely on suboptimal plug-in-type nonparametric estimators; but both approaches can lead to substantial bias, the former via model misspecification and the latter via smoothing. Under an identifying assumption that two lists are conditionally independent given measured covariates, we make several contributions. First, we derive the nonparametric efficiency bound for estimating the capture probability, which indicates the best possible performance of any estimator, and sheds light on the statistical limits of capture-recapture methods. Then we present a new estimator, that has a double robustness property new to capture-recapture, and is near-optimal in a nonasymptotic sense, under relatively mild nonparametric conditions. Next, we give a confidence interval construction method for total population size from generic capture probability estimators, and prove nonasymptotic near-validity. Finally, we apply them to estimate the number of killings and disappearances in Peru during its internal armed conflict between 1980 and 2000. Supplementary materials for this article are available online.

统计学人口估计非参数统计生物统计学社会科学