🌙

基于距离约束的半监督集成分类器用于高维数据

Semi-Supervised Ensemble Classifier Based on Distance Constraint for High-Dimensional Data

IEEE Transactions on Systems, Man, and Cybernetics: Systems · 2025
被引 2
ABS 3

中文导读

针对高维噪声数据中标记样本少的问题,提出一种基于距离约束正则化的半监督宽度学习系统(DRBLS)及其集成方法(E-DRBLS),通过混合降维空间生成提升分类性能。

Abstract

Due to its exceptional feature representation capabilities and high computational efficiency, the broad learning system (BLS) has been widely employed in various classification tasks. Nevertheless, BLS encounters considerable challenges in semi-supervised classification tasks involving complex heterogeneous data, given the data’s high-dimensional and noisy nature, coupled with a limited number of available labeled samples. To tackle these challenges, this article introduces a semi-supervised BLS based on distance constraint regularization (DRBLS) and a semi-supervised broad ensemble method (E-DRBLS) for high-dimensional data. Specifically, we present a distance constraint regularization (DR) that utilizes both labeled and unlabeled data to derive an optimal projection matrix, which maximizes the preservation of the original data’s intrinsic distribution structure. DR is designed to minimize intraclass distance, maximize interclass distance, and minimize the distance between neighboring samples. To boost the performance of BLS in semi-supervised classification, we integrate DR and BLS to construct the semi-supervised classifier DRBLS. Finally, we propose a mixed dimensionality reduction space generation (MDRSG) method that generates multiple high-quality and diverse mixed dimensionality reduction spaces (MDRSs). Based on MDRS, an ensemble framework, E-DRBLS, is developed for semi-supervised classification tasks targeting high-dimensional data. Comprehensive experiments confirm the superiority of the proposed methods.

机器学习分类算法高维数据处理半监督学习