Identification of Expected Outcomes in a Data Error Mixing Model With Multiplicative Mean Independence
研究了在数据污染抽样中识别均值结果的问题,将统计独立性假设放松为均值独立性,并推广到比例因子已知或有界的情况,以处理非随机报告误差,如非法药物使用调查中的误差。
We consider the problem of identifying a mean outcome in corrupt sampling where the observed outcome is a mixture of the distribution of interest and some other distribution. We make two contributions to this literature. First, the statistical independence assumption maintained under contaminated sampling is relaxed to the weaker assumption that the outcome is mean independent of the mixing process. We then generalize this restriction to allow the two conditional means to differ by a known or bounded factor of proportionality. Second, in the special case of a binary outcome, we consider the possibility that draws from the alternative distribution are known to be erroneous, as might be the case in a mixture model of response error. We illustrate how these assumptions can be used to inform researchers about the population's use of illicit drugs in the presence of nonrandom reporting errors. In this application, we find that a response error model with multiplicative mean independence is easy to motivate and can have substantial identifying power.