当零可能不是零：使用评分者间信度评估基金同行评审的注意事项

When Zero May Not Be Zero: A Cautionary Note on the Use of Inter-Rater Reliability in Evaluating Grant Peer Review

Journal of the Royal Statistical Society. Series A: Statistics in Society · 2021

被引 30 · 同刊同年前 5%

ABS 3

Elena A. Erosheva 通讯
Patrícia Martinková
Carole J. Lee

中文导读

研究发现，仅用高质量提案子集计算评分者间信度容易得到零值，但这不代表评审随意；完整数据下信度高于0.6，且评审人数少时也可能出现零估计。

Abstract

Abstract Considerable attention has focused on studying reviewer agreement via inter-rater reliability (IRR) as a way to assess the quality of the peer review process. Inspired by a recent study that reported an IRR of zero in the mock peer review of top-quality grant proposals, we use real data from a complete range of submissions to the National Institutes of Health and to the American Institute of Biological Sciences to bring awareness to two important issues with using IRR for assessing peer review quality. First, we demonstrate that estimating local IRR from subsets of restricted-quality proposals will likely result in zero estimates under many scenarios. In both data sets, we find that zero local IRR estimates are more likely when subsets of top-quality proposals rather than bottom-quality proposals are considered. However, zero estimates from range-restricted data should not be interpreted as indicating arbitrariness in peer review. On the contrary, despite different scoring scales used by the two agencies, when complete ranges of proposals are considered, IRR estimates are above 0.6 which indicates good reviewer agreement. Furthermore, we demonstrate that, with a small number of reviewers per proposal, zero estimates of IRR are possible even when the true value is not zero.

同行评审基金评审评分者间信度科研评估

阅读原文 ↗