Should We Use F -Tests for Model Fit Instead of Chi-Square in Overidentified Structural Equation Models?
指出在结构方程模型中,小样本下常用的卡方检验和拟合指数容易过度拒绝正确模型,而F检验在样本量小于200或参数比小于3时具有更好的统计性质,适合替代传统方法。
Debate continues about whether the likelihood ratio test ( T ML ) or goodness-of-fit indices are most appropriate for assessing data-model fit in structural equation models. Though potential advantages and disadvantages of these methods with large samples are often discussed, shortcomings concomitant with smaller samples are not. This article aims to (a) highlight the broader small sample issues with both approaches to data-model fit assessment, (b) note that what constitutes a small sample is common in empirical studies (approximately 20% to 50% in review studies, depending on the definition of “small”), and (c) more widely introduce F-tests as a desirable alternative than the traditional T ML tests, small-sample corrections, or goodness-of-fit indices with smaller samples. Both goodness-of-fit indices and comparing T ML to a chi-square distribution at smaller samples leads to overrejection of well-fitting models. Simulations and example analyses show that F-tests yield more desirable statistical properties—with or without normality—than standard approaches like chi-square tests or goodness-of-fit indices with smaller samples, roughly defined as N < 200 or N: df < 3.