本福特定律作为调查可靠性的指标:我们能相信自己的数据吗?

BENFORD'S LAW AS AN INDICATOR OF SURVEY RELIABILITY—CAN WE TRUST OUR DATA?

Journal of Economic Surveys · 2019
被引 32
人大 AABS 2

中文导读

用本福特定律检验六个常用调查数据集的收入变量,发现多数数据与定律偏差较大,表明存在可靠性问题,并建议研究者优先使用家庭层面数据以减少观测误差。

Abstract

Abstract This paper analyzes how closely different income measures conform to Benford's law, a mathematical predictor of probable first digit distribution across many sets of numbers. Because Benford's law can be used to test data set reliability, we use a Benford analysis to assess the quality of six widely used survey data sets. Our findings indicate that although income generally obeys Benford's law, almost all the data sets show substantial discrepancies from it, which we interpret as a strong indicator of reliability issues in the survey data. This result is confirmed by a simulation, which demonstrates that household level income data do not manifest the same poor performance as individual level data. This finding implies that researchers should focus on household level characteristics whenever possible to reduce observation errors.

本福特定律调查数据可靠性收入数据数据质量