Discriminant Analysis Based on Binary and Continuous Variables
提出一种双判别函数,用于对同时包含二元和连续变量的观测进行分类,并引入双逆抽样方案保证样本协方差矩阵非奇异,给出了样本双判别函数的渐近展开。
Abstract An observation consisting of both binary and continuous variables may be classified into one of two populations by the double-discriminant function based on the point-biserial model. When the parameters are unknown or partially known, a sample double-discriminant function is obtained by replacing the unknown parameters by their sample estimates. A sampling scheme referred to as the double inverse sampling is proposed to ensure nonsingularity of the sample covariance matrices. An asymptotic expansion for the distribution of the sample double-discriminant function is given under the double inverse sampling scheme. Comparisons of three classification procedures—double-discriminant function, X-out procedure, and X-continuous procedure—are made.