交叉分类过度离散多项数据的分层模型

Hierarchical Models for Cross-Classified Overdispersed Multinomial Data

Journal of Business & Economic Statistics · 1991
被引 17
人大 AABS 4

中文导读

扩展了广义多项模型,通过Dirichlet分布处理过度离散问题,提出广义Dirichlet-多项模型,适用于重复测量或聚类数据,并应用于绩效评估和集成电路良率分析。

Abstract

When a vector of sample proportions is not obtained through a simple random sampling, the covariance matrix for the sample vector can differ substantially from the one corresponding to the multinomial model (Wilson 1989). For example, clustering effects of subject effects in repeated-measure experiments can cause the variance of the observed proportions to be much larger than variances under the multinomial model. The phenomenon is generally referred to as overdispersion. Tallis (1962) proposed a model for identically distributed multinomials with a common measure of correlation and referred to it as the generalized multinomial model. This generalized multinomial model is extended in this article to account for overdispersion by allowing the vectors of proportions to vary according to a Dirichlet distribution. The generalized Dirichlet-multinomial model (as it is referred to here) allows for a second order of pairwise correlation among units, a type of assumption found reasonable in some biological data (Kupper and Haseman 1978) and introduced here to business data. An alternative derivation allowing for two kinds of variation is also considered. Asymptotic normal properties of parameter estimators are used to construct Wald statistics for testing hypotheses. The methods are illustrated with applications to performance evaluation monthly data and an integrated circuit yield analysis.

分层模型交叉分类过度分散多项数据