🌙

面向生物银行规模关联研究的定量响应变量的强大变换方法

A Powerful Transformation of Quantitative Responses for Biobank-Scale Association Studies

Journal of the American Statistical Association · 2025
被引 0
ABS 4

中文导读

提出一种利用误差密度信息的响应变量变换方法,在生物银行规模数据中能高效检测微弱遗传信号,并通过UK Biobank数据验证其控制错误率和提升统计功效的能力。

Abstract

In linear regression models with non-Gaussian errors, transformations of the response variable are widely used in a broad range of applications. Motivated by various genetic association studies, transformation methods for hypothesis testing have received substantial interest. In recent years, the rise of biobank-scale genetic studies, which feature a vast number of participants that could be around half a million, spurred the need for new transformation methods that are both powerful for detecting weak genetic signals and computationally efficient for large-scale data. In this work, we propose a novel transformation method that leverages the information of the error density. This transformation leads to locally most powerful tests and therefore has strong power for detecting weak signals. To make the computation scalable to biobank-scale studies, we harnessed the nature of weak genetic signals and proposed a consistent and computationally efficient estimator of the transformation function. Through extensive simulations and a gene-based analysis of spirometry traits from the UK Biobank, we validate that our approach maintains stringent control over type I error rates and significantly enhances statistical power over existing methods.

遗传关联研究统计方法生物银行假设检验