Everything under control: comparing machine learning and classical econometric impact assessment methods using FADN data
利用农场会计数据网络(FADN)的模拟数据,比较了机器学习与经典计量方法在不同因果属性下的政策评估表现,发现因果森林在识别因果效应和异质性上最优,但机器学习在控制变量选择和潜在混杂处理上效果不佳。
Abstract Machine learning (ML) methods have been proposed to improve the assessment of agricultural policies through enhanced causal inference. This study uses a simulation framework tailored to Farm Accountancy Data Network (FADN) data to scrutinize the performance of both ML and classical methods under diverse causal properties crucial for identification. Our findings reveal significant variations in performance across different treatment assignment rules, sample sizes and causal properties. Notably, the Causal Forest method consistently outperforms others in retrieving the causal effect and accurately characterizing its heterogeneity. However, the data-driven approach of ML methods proves ineffective in selecting the correct set of controls and addressing latent confounding.