异方差下复合零假设的大规模多重检验

Large-scale multiple testing of composite null hypotheses under heteroskedasticity

Biometrika · 2025
被引 2
ABS 4

中文导读

提出一种异方差调整的多重检验方法,避免标准化数据,直接利用方差信息,通过非参数经验贝叶斯解卷积估计量控制假发现率,在模拟和移动游戏用户检测应用中表现更优。

Abstract

Summary Heteroskedasticity poses several methodological challenges in designing valid and powerful procedures for simultaneous testing of composite null hypotheses. In particular, the conventional practice of standardizing or rescaling heteroskedastic test statistics in this setting may severely affect the power of the underlying multiple testing procedure. Additionally, when the inferential parameter of interest is correlated with the variance of the test statistic, methods that ignore this dependence may fail to control the Type I error at the desired level. We propose a new heteroskedasticity-adjusted multiple testing procedure that avoids data reduction by standardization and directly incorporates the side information from the variances into the testing procedure. Our approach relies on an improved nonparametric empirical Bayes deconvolution estimator that offers a practical way of capturing the dependence between the inferential parameter of interest and the variance of the test statistic. We develop theory to establish that the proposed procedure is asymptotically valid and optimal for false discovery rate control. Simulation results demonstrate that our method outperforms existing procedures, with substantial power gains across many settings at the same false discovery rate level. The method is illustrated with an application involving the detection of engaged users on a mobile game app.

计量经济学统计学多重检验异方差假发现率控制