🌙

从使用受访者驱动抽样的研究中推断连续数据的双变量关联

Inferring bivariate associations with continuous data from studies using respondent-driven sampling

Journal of the Royal Statistical Society. Series C: Applied Statistics · 2024
被引 0
ABS 3

中文导读

针对受访者驱动抽样(RDS)中连续变量的双变量关联,提出一种半参数随机化检验方法,以控制同质性导致的假阳性,并应用于南非吸烟人群的结核病研究。

Abstract

Abstract Respondent-driven sampling (RDS) is a link-tracing sampling design that was developed to sample from hidden populations. Although associations between variables are of great interest in epidemiological research, there has been little statistical work on inference on relationships between variables collected through RDS. The link-tracing design, combined with homophily, the tendency for people to connect to others with whom they share characteristics, induces similarity between linked individuals. This dependence inflates the Type 1 error of conventional statistical methods (e.g. t-tests, regression, etc.). A semiparametric randomization test for bivariate association was developed to test for association between two categorical variables. We directly extend this work and propose a semiparametric randomization test for relationships between two variables, when one or both are continuous. We apply our method to variables that are important for understanding tuberculosis epidemiology among people who smoke illicit drugs in Worcester, South Africa.

流行病学统计学抽样方法社会科学公共卫生