STATISTICAL POWER OF TRAINING EVALUATION DESIGNS
研究了后测仅、增益分数和协方差分析三种培训评估设计达到不同统计功效所需的样本量,发现设计类型、前后测相关性和培训效果大小共同决定功效曲线,并基于典型样本量给出了设计建议。
Sample size requirements needed to achieve various levels of statistical power using posttest‐only, gain‐score, and analysis of covariance designs in evaluating training interventions have been developed. Results are presented which indicate that the power to detect true effects differs according to the type of design, the correlation between the pre‐ and posttest, and the size of the effect due to the training program. We show that the type of design and correlations between the pre‐ and posttest complexly determine the power curve. Finally, an estimate of typical sample sizes used in training evaluation design has been determined and reviewed to determine the power of the various designs to detect true effects, given this sample‐size specification. Recommendations for type of design are provided based on sample size and projected correlations between pre‐ and posttest scores.