🌙

非稀疏高维数据变化点检测的无分布方法

A Distribution-Free Method for Change Point Detection in Non-Sparse High Dimensional Data

Journal of Computational and Graphical Statistics · 2024
被引 1
ABS 3

中文导读

提出一种无分布的距离方法,用于高维数据变化点检测,能处理样本量远小于维度或非稀疏变化的情况,并开发了R包HDDchangepoint。

Abstract

We propose a distribution-free distance-based method for high dimensional change points that can address challenging situations when the sample size is very small compared to the dimension as in the so-called HDLSS data or when non-sparse changes may occur due to change in many variables but with small significant magnitudes. Our method can detect changes in mean or variance of high dimensional observations as well as other distributional changes. We present efficient algorithms that can detect single and multiple high dimensional change points. We use nonparametric metrics, including a new dissimilarity measure and some new distance and difference distance matrices, to develop a procedure to estimate change point locations. We also introduce a nonparametric test to determine the significance of estimated change points. We provide theoretical guaranties for our method and demonstrate its empirical performance in comparison with some of the recent methods for high dimensional change points. An R package called HDDchangepoint is developed to implement the proposed method.

计算机科学数据挖掘模式识别人工智能