预测贫困

Predicting Poverty

World Bank Economic Review · 2024
被引 4
人大 A-ABS 3

中文导读

通过人为制造不同缺失模式和数据缺失比例,比较经典计量模型与机器学习模型在已知真实反事实贫困率下的预测能力,发现随机森林在多数场景下更稳定准确。

Abstract

Abstract Poverty prediction models are used to address missing data issues in a variety of contexts such as poverty profiling, targeting with proxy-means tests, cross-survey imputations such as poverty mapping, top and bottom income studies, or vulnerability analyses. Based on the models used by this literature, this paper conducts a study by artificially corrupting data clear of missing incomes with different patterns and shares of missing incomes. It then compares the capacity of classic econometric and machine-learning models to predict poverty under different scenarios with full information on observed and unobserved incomes, and the true counterfactual poverty rate. Random forest provides more consistent and accurate predictions under most but not all scenarios.

贫困预测机器学习随机森林缺失数据