关于检验一组对象聚类质量的程序说明

A NOTE ON PROCEDURES FOR TESTING THE QUALITY OF A CLUSTERING OF A SET OF OBJECTS

DECISION SCIENCES · 1980
被引 53
人大 AABS 3

中文导读

指出已有聚类质量检验程序未能指定有意义的抽样分布,并提出基于点二列相关的指标作为恢复度量,进而构建有效的统计检验方法。

Abstract

Abstract Despite the increased application of cluster analysis in decision sciences, few attempts have been made to derive hypothesis‐testing procedures for the evaluation of clustering solutions. In fact, the present paper shows that at least one such attempt failed to specify a meaningful sampling distribution for the test procedure. An alternative index based on the concept of point‐biserial correlation is proposed as a possible recovery measure. The index is subsequently used to form the basis of a valid statistical test for the existence of cluster structure.

聚类分析统计检验决策科学数据挖掘