🌙

度量空间中基于度量分布函数的非参数统计推断

Nonparametric Statistical Inference via Metric Distribution Function in Metric Spaces

Journal of the American Statistical Association · 2023
被引 12
ABS 4

中文导读

针对欧氏空间分布函数无法处理复杂数据对象的问题,本文在度量空间中仅通过度量定义了一类度量分布函数,证明了对应定理和Glivenko-Cantelli定理,并开发了同质性检验和相互独立性检验方法。

Abstract

The distribution function is essential in statistical inference and connected with samples to form a directed closed loop by the correspondence theorem in measure theory and the Glivenko-Cantelli and Donsker properties. This connection creates a paradigm for statistical inference. However, existing distribution functions are defined in Euclidean spaces and are no longer convenient to use in rapidly evolving data objects of complex nature. It is imperative to develop the concept of the distribution function in a more general space to meet emerging needs. Note that the linearity allows us to use hypercubes to define the distribution function in a Euclidean space. Still, without the linearity in a metric space, we must work with the metric to investigate the probability measure. We introduce a class of metric distribution functions through the metric only. We overcome this challenging step by proving the correspondence theorem and the Glivenko-Cantelli theorem for metric distribution functions in metric spaces, laying the foundation for conducting rational statistical inference for metric space-valued data. Then, we develop a homogeneity test and a mutual independence test for non-Euclidean random objects and present comprehensive empirical evidence to support the performance of our proposed methods.

非参数统计度量空间统计推断分布函数