🌙

成分数据中的稳健差异丰度检验

Robust differential abundance test in compositional data

Biometrika · 2022
被引 13
ABS 4

中文导读

提出一种稳健差异丰度检验方法,解决成分数据中零值多和组成约束导致的统计难题,能控制假阳性并处理协变量混杂效应,适用于单细胞、RNA-seq和微生物组数据分析。

Abstract

Summary Differential abundance tests for compositional data are essential and fundamental in various biomedical applications, such as single-cell, bulk RNA-seq and microbiome data analysis. However, because of the compositional constraint and the prevalence of zero counts in the data, differential abundance analysis on compositional data remains a complicated and unsolved statistical problem. This article proposes a new differential abundance test, the robust differential abundance test, to address these challenges. Compared with existing methods, the robust differential abundance test is simple and computationally efficient, is robust to prevalent zero counts in compositional datasets, can take the data’s compositional nature into account, and has a theoretical guarantee of controlling false discoveries in a general setting. Furthermore, in the presence of observed covariates, the robust differential abundance test can work with covariate-balancing techniques to remove potential confounding effects and draw reliable conclusions. The proposed test is applied to several numerical examples, and its merits are demonstrated using both simulated and real datasets.

生物医学单细胞测序微生物组统计方法