BENFORD'S LAW AS AN INDICATOR OF SURVEY RELIABILITY—CAN WE TRUST OUR DATA?
用本福特定律检验六个常用调查数据集的收入变量,发现多数数据与定律偏差较大,表明存在可靠性问题,并建议研究者优先使用家庭层面数据以减少观测误差。
Abstract This paper analyzes how closely different income measures conform to Benford's law, a mathematical predictor of probable first digit distribution across many sets of numbers. Because Benford's law can be used to test data set reliability, we use a Benford analysis to assess the quality of six widely used survey data sets. Our findings indicate that although income generally obeys Benford's law, almost all the data sets show substantial discrepancies from it, which we interpret as a strong indicator of reliability issues in the survey data. This result is confirmed by a simulation, which demonstrates that household level income data do not manifest the same poor performance as individual level data. This finding implies that researchers should focus on household level characteristics whenever possible to reduce observation errors.