Proposer of the vote of thanks and contribution to the Discussion of ‘the Discussion Meeting on Probabilistic and statistical aspects of machine learning’
本文提出一种基于神经网络自动生成离线变化点检测方法,理论量化误差率与训练数据量的关系,实验表明在噪声相关或重尾时优于传统CUSUM方法。
Detecting change points in data is challenging because of the range of possible types of change and types of behaviour of data when there is no change.Statistically efficient methods for detecting a change will depend on both of these features, and it can be difficult for a practitioner to develop an appropriate detection method for their application of interest.We show how to automatically generate new offline detection methods based on training a neural network.Our approach is motivated by many existing tests for the presence of a change point being representable by a simple neural network, and thus a neural network trained with sufficient data should have performance at least as good as these methods.We present theory that quantifies the error rate for such an approach, and how it depends on the amount of training data.Empirical results show that, even with limited training data, its performance is competitive with the standard cumulative sum (CUSUM) based classifier for detecting a change in mean when the noise is independent and Gaussian, and can substantially outperform it in the presence of auto-correlated or heavy-tailed noise.Our method also shows strong results in detecting and localizing changes in activity based on accelerometer data.