🌙

纵向基因-环境交互作用的稳健稀疏贝叶斯回归

Robust sparse Bayesian regression for longitudinal gene–environment interactions

Journal of the Royal Statistical Society. Series C: Applied Statistics · 2025
被引 3 · 同刊同年前 6%
ABS 3

中文导读

针对纵向研究中高维遗传因素与环境的交互作用,提出一种稳健稀疏贝叶斯混合模型,能处理异常值和重复测量相关性,通过结构化尖峰-平板先验进行变量选择,并在模拟和脂质组学数据中表现更优。

Abstract

Abstract In longitudinal studies, repeated measure analysis of variance (ANOVA) is a classical analysis where selecting important main and interaction effects for accurate estimation and prediction is among one of its central goals. With high-dimensional genetic factors, ANOVA leads to a sparse longitudinal gene–environment (G×E) interaction problem that has not been thoroughly investigated so far, partially due to the challenges to incorporate robustness against skewed phenotypic measurements, intra-cluster correlations among longitudinal observations, and structured sparsity arising from the ANOVA design. We have developed a novel robust sparse Bayesian mixed model to tackle these challenges. Outliers and inter-relatedness among repeated measurements can be efficiently accommodated. Meanwhile, the proposed model conducts robust Bayesian variable selection accounting for main and interaction effects via structured spike-and-slab priors. We have developed Gibbs samplers and MCMC algorithms for fast computation and posterior inference. The advantage of the proposed method over benchmarks in variable selection and estimation has been established through extensive simulations. In the case study, we have analysed longitudinal lipidomics data with repeatedly measured body weight of CD-1 mice from a cancer prevention study. The proposed model has identified main and interactions with important implications and led to better prediction performance over alternative methods.

贝叶斯统计纵向数据分析基因-环境交互变量选择稳健回归