🌙

多源无监督域适应的分布鲁棒学习

Distributionally robust learning for multisource unsupervised domain adaptation

Annals of Statistics · 2026
被引 0 · 同刊同年前 7%
ABS 4*

中文导读

提出一种分布鲁棒模型,利用多个有标签源域和无标签目标域数据,通过加权平均源域条件结果模型并加入偏差校正,提升目标域泛化能力,适用于随机森林、提升法、神经网络等算法。

Abstract

Empirical risk minimization often performs poorly when the distribution of the target domain differs from those of the source domains. To address such potential distributional shifts, we develop an unsupervised domain adaptation approach that leverages labeled data from multiple source domains and unlabeled data from the target domain. We introduce a distributionally robust model that optimizes an adversarial reward based on explained variance across a class of target distributions, ensuring generalization to the target domain. We show that the proposed robust model is a weighted average of conditional outcome models from the source domains. This formulation allows us to compute the robust model through the aggregation of source models, which can be estimated using various machine learning algorithms of the user’s choice such as random forests, boosting and neural networks. Additionally, we introduce a bias-correction step to obtain a more accurate aggregation weight, which is effective for various machine learning algorithms. Our framework can be interpreted as a distributionally robust federated learning approach that satisfies privacy constraints while providing insights into the importance of each source for prediction on the target domain. The performance of our method is evaluated on both simulated and real data.

机器学习域适应分布鲁棒优化无监督学习