回归:短回归与长回归

Regressions, Short and Long

Econometrica · 2002
被引 0
人大 A+FT50ABS 4*

中文导读

研究当已知短回归条件分布但未知长回归条件分布时,如何识别长回归,并给出识别区域和边界,适用于利用两个独立数据集进行推断的研究者。

Abstract

We study the problem of identi cation of the long regression E(y j x � z) when the short conditional distributions P (y j x) and P (z j x) are known but the long conditional distribution P (y j x � z) is not known. This problem often arises when a researcher utilizes data from two separate data sets. (A leading example is the ecological inference problem of political science, where voting behavior across electoral districts is observed from administrative records, the demographic composition of voters within a district is observed from census data, and the researcher wants to infer voting behavior conditional on district and demographic attributes.) We isolate an identi cation region containing feasible values of the long regression, and show that this region forms a sharp bound on the long regression. The identi cation region can be calculated precisely when y has nite support. When y has in nite support we characterize two sets, one that contains the identi cation region, and one that is contained by it. Following this completely nonparametric analysis, we examine the identifying power yielded by exclusion restrictions across distinct covariate values. Such restrictions cause the identi cation region to shrink, in many cases to a single point. To illustrate the theory, we pose and address this hypothetical question: What would be the outcome if the 1996 U.S. presidential election were re-enacted in a population of di erent demographic composition, ceteris paribus? We have bene tted from the opportunity to present this research in seminars at Northwestern

长回归识别短条件分布生态推断识别区域