对P值操纵稳健的临界值

Critical Values Robust to P-hacking

Review of Economics and Statistics · 2024

被引 4

人大 AFT50ABS 4

Adam McCloskey · 科罗拉多大学博尔德分校
Pascal Michaillat · 加州大学圣克鲁兹分校

中文导读

针对现实中普遍存在的P值操纵行为，构建了一个包含该行为的假设检验模型，推导出能避免虚假显著结果比预期更频繁出现的稳健临界值，该值大于经典临界值，在医学校准模型中为经典临界值但显著性水平降至五分之一。

Abstract

Abstract P-hacking is prevalent in reality but absent from classical hypothesis-testing theory. We therefore build a model of hypothesis testing that accounts for p-hacking. From the model, we derive critical values such that, if they are used to determine significance, and if p-hacking adjusts to the new significance standards, spurious significant results do not occur more often than intended. Because of p-hacking, such robust critical values are larger than classical critical values. In the model calibrated to medical science, the robust critical value is the classical critical value for the same test statistic but with one fifth of the significance level.

P-hacking假设检验临界值显著性水平

阅读原文 ↗