THE MEANING OF FAILED REPLICATIONS: A REVIEW AND PROPOSAL
提出一个保守标准来判定一项研究是否构成对另一项研究的复制,强调复制研究应证明其本应得到与原始研究相同的结果,并指出许多被称为复制检验的研究其实不应如此归类。
Abstract The welcome rise of replication tests in economics has not been accompanied by a consensus standard for determining what constitutes a replication . A discrepant replication, in current usage of the term, can signal anything from an unremarkable disagreement over methods to scientific incompetence or misconduct. This paper proposes a standard for classifying one study as a replication of some other study. It is a standard that places the burden of proof on a study to demonstrate that it should have obtained identical results to the original, a conservative standard that is already used implicitly by many researchers. It contrasts this standard with decades of unsuccessful attempts to harmonize terminology, and argues that many prominent results described as replication tests should not be described as such. Adopting a conservative standard like this one can improve incentives for researchers, encouraging more and better replication tests.