Large-scale multiple testing of composite null hypotheses under heteroskedasticity
提出一种异方差调整的多重检验方法,避免标准化数据,直接利用方差信息,通过非参数经验贝叶斯解卷积估计量控制假发现率,在模拟和移动游戏用户检测应用中表现更优。
Summary Heteroskedasticity poses several methodological challenges in designing valid and powerful procedures for simultaneous testing of composite null hypotheses. In particular, the conventional practice of standardizing or rescaling heteroskedastic test statistics in this setting may severely affect the power of the underlying multiple testing procedure. Additionally, when the inferential parameter of interest is correlated with the variance of the test statistic, methods that ignore this dependence may fail to control the Type I error at the desired level. We propose a new heteroskedasticity-adjusted multiple testing procedure that avoids data reduction by standardization and directly incorporates the side information from the variances into the testing procedure. Our approach relies on an improved nonparametric empirical Bayes deconvolution estimator that offers a practical way of capturing the dependence between the inferential parameter of interest and the variance of the test statistic. We develop theory to establish that the proposed procedure is asymptotically valid and optimal for false discovery rate control. Simulation results demonstrate that our method outperforms existing procedures, with substantial power gains across many settings at the same false discovery rate level. The method is illustrated with an application involving the detection of engaged users on a mobile game app.