利用弱对齐源的数据融合

Data Fusion Using Weakly Aligned Sources

Journal of the American Statistical Association · 2025

被引 3 · 同刊同年前 8%

ABS 4

Sijia Li 通讯
Peter B. Gilbert
Rui Duan
Alex Luedtke

中文导读

提出一种新数据融合方法，利用多个数据源估计光滑有限维参数，允许纳入弱对齐源并量化效率增益，适用于HIV抗体预防试验等场景。

Abstract

We introduce a new data fusion method that utilizes multiple data sources to estimate a smooth, finite-dimensional parameter. Most existing methods only make use of fully aligned data sources that share common conditional distributions of one or more variables of interest. However, in many settings, the scarcity of fully aligned sources can make existing methods require unduly large sample sizes to be useful. Our approach enables the incorporation of weakly aligned data sources that are not perfectly aligned, provided their degree of misalignment is known up to finite-dimensional parameters. We quantify the additional efficiency gains achieved through the integration of these weakly aligned sources. We characterize the semiparametric efficiency bound and provide a general means to construct estimators achieving these efficiency gains. We illustrate our results by fusing data from two harmonized HIV monoclonal antibody prevention efficacy trials to study how a neutralizing antibody biomarker associates with HIV genotype.

数据融合计量经济学生物统计临床试验

免费全文 ↗阅读原文 ↗