数据误差混合模型中具有乘法均值独立性的预期结果识别

Identification of Expected Outcomes in a Data Error Mixing Model With Multiplicative Mean Independence

Journal of Business & Economic Statistics · 2010

被引 11

人大 AABS 4

Brent Kreider · 爱荷华州立大学
John V. Pepper · 弗吉尼亚大学

中文导读

研究了在数据污染抽样中识别均值结果的问题，将统计独立性假设放松为均值独立性，并推广到比例因子已知或有界的情况，以处理非随机报告误差，如非法药物使用调查中的误差。

Abstract

We consider the problem of identifying a mean outcome in corrupt sampling where the observed outcome is a mixture of the distribution of interest and some other distribution. We make two contributions to this literature. First, the statistical independence assumption maintained under contaminated sampling is relaxed to the weaker assumption that the outcome is mean independent of the mixing process. We then generalize this restriction to allow the two conditional means to differ by a known or bounded factor of proportionality. Second, in the special case of a binary outcome, we consider the possibility that draws from the alternative distribution are known to be erroneous, as might be the case in a mixture model of response error. We illustrate how these assumptions can be used to inform researchers about the population's use of illicit drugs in the presence of nonrandom reporting errors. In this application, we find that a response error model with multiplicative mean independence is easy to motivate and can have substantial identifying power.

数据误差混合模型乘法均值独立性期望结果识别二元结果

阅读原文 ↗