🌙

文档分类的遗传规划:一种直推式迁移学习系统

Genetic Programming for Document Classification: A Transductive Transfer Learning System

IEEE Transactions on Cybernetics · 2023
被引 4
ABS 3

中文导读

提出一种直推式迁移学习系统,利用遗传规划在源域进化出的解自动为无标签的目标域训练数据生成伪标签,从而训练分类器,在九个任务上比现有方法预测更准。

Abstract

Document classification is a challenging task to the data being high-dimensional and sparse. Many transfer learning methods have been investigated for improving the classification performance by effectively transferring knowledge from a source domain to a target domain, which is similar to but different from the source domain. However, most of the existing methods cannot handle the case that the training data of the target domain does not have labels. In this study, we propose a transductive transfer learning system, utilizing solutions evolved by genetic programming (GP) on a source domain to automatically pseudolabel the training data in the target domain in order to train classifiers. Different from many other transfer learning techniques, the proposed system pseudolabels target-domain training data to retrains classifiers using all target-domain features. The proposed method is examined on nine transfer learning tasks, and the results show that the proposed transductive GP system has better prediction accuracy on the test data in the target domain than existing transfer learning approaches including subspace alignment-domain adaptation methods, feature-level-domain adaptation methods, and one latest pseudolabeling strategy-based method.

文档分类迁移学习遗传规划伪标签机器学习