🌙

利用调查数据和机器学习技术丰富行政数据

Enriching administrative data using survey data and machine learning techniques

Economics Letters · 2024
被引 1
人大 BABS 3

中文导读

提出一种用机器学习将调查数据中的信息迁移到行政数据的方法,并以德国最低工资研究为例验证其有效性,帮助研究者利用大规模行政数据回答更细致的问题。

Abstract

I propose an approach to enrich administrative data with information only available in survey data using machine learning techniques. To illustrate the approach, I replicate a prominent study that used survey data to analyze the federal minimum wage introduction in Germany. In contrast to the original study, I use the universe of German establishments rather than the limited number of establishments that participated in the survey. As the administrative data do not contain information on whether establishments were treated by the minimum wage, I use a random forest classifier, trained on survey data, to predict the treatment status of establishments. The results obtained using the administrative data are qualitatively similar to the results obtained using the survey data. Beyond replication of previous research, this approach broadens the research potential of administrative data, enabling researchers to explore more detailed research questions at scale. • I propose an approach to enrich administrative data with additional variables. • I use machine learning and survey data to enrich the administrative data. • The approach enables researchers to answer more detailed questions at scale. • To illustrate the approach, I replicate a study on the effects of minimum wages.

经济学机器学习数据科学劳动经济学