🌙

多标签随机子空间集成分类

Multi-label Random Subspace Ensemble Classification

Journal of Computational and Graphical Statistics · 2024
被引 0
ABS 3

中文导读

提出多标签随机子空间集成(mRaSE)框架,通过随机采样子空间并选择最优弱学习器来提升多标签分类性能,同时提供无模型特征排序,在模拟和真实数据上优于随机森林和深度神经网络。

Abstract

In this work, we develop a new ensemble learning framework, multi-label Random Subspace Ensemble (mRaSE), for multi-label classification. Given a base classifier (e.g., multinomial logistic regression, classification tree, K-nearest neighbors), mRaSE works by first randomly sampling a collection of subspaces, then choosing the best ones that achieve the minimum cross-validation errors and, finally, aggregating the chosen weak learners. In addition to its superior prediction performance, mRaSE also provides a model-free feature ranking depending on the given base classifier. An iterative version of mRaSE is also developed to further improve the performance. A model-free extension is pursued on the iterative version, leading to the so-called Super mRaSE, which accepts a collection of base classifiers as input to the algorithm. We show the proposed algorithms compared favorably with the state-of-the-art classification algorithm including random forest and deep neural network, via extensive simulation studies and two real data applications. The new algorithms are implemented in an updated version of the R package RaSEn.

多标签分类集成学习随机子空间机器学习分类算法