🌙

多路稀疏距离加权判别

Multiway Sparse Distance Weighted Discrimination

Journal of Computational and Graphical Statistics · 2022
被引 2
ABS 3

中文导读

提出一种适用于任意维度和稀疏程度的多路分类框架,扩展了距离加权判别方法,在模拟和真实数据(如磁共振波谱和基因表达数据)中提升了分类准确率,并提供了R包实现。

Abstract

Modern data often take the form of a multiway array. However, most classification methods are designed for vectors, i.e., 1-way arrays. Distance weighted discrimination (DWD) is a popular high-dimensional classification method that has been extended to the multiway context, with dramatic improvements in performance when data have multiway structure. However, the previous implementation of multiway DWD was restricted to classification of matrices, and did not account for sparsity. In this paper, we develop a general framework for multiway classification which is applicable to any number of dimensions and any degree of sparsity. We conducted extensive simulation studies, showing that our model is robust to the degree of sparsity and improves classification accuracy when the data have multiway structure. For our motivating application, magnetic resonance spectroscopy (MRS) was used to measure the abundance of several metabolites across multiple neurological regions and across multiple time points in a mouse model of Friedreich's ataxia, yielding a four-way data array. Our method reveals a robust and interpretable multi-region metabolomic signal that discriminates the groups of interest. We also successfully apply our method to gene expression time course data for multiple sclerosis treatment. An R implementation is available in the package MultiwayClassification at http://github.com/lockEF/MultiwayClassification.

分类方法高维数据多路数组稀疏性生物医学应用