A NOTE ON PROCEDURES FOR TESTING THE QUALITY OF A CLUSTERING OF A SET OF OBJECTS
指出已有聚类质量检验程序未能指定有意义的抽样分布,并提出基于点二列相关的指标作为恢复度量,进而构建有效的统计检验方法。
Abstract Despite the increased application of cluster analysis in decision sciences, few attempts have been made to derive hypothesis‐testing procedures for the evaluation of clustering solutions. In fact, the present paper shows that at least one such attempt failed to specify a meaningful sampling distribution for the test procedure. An alternative index based on the concept of point‐biserial correlation is proposed as a possible recovery measure. The index is subsequently used to form the basis of a valid statistical test for the existence of cluster structure.