主题专家对陈述极端性的评分能否用于简化单维配对偏好量表的开发？

Can Subject Matter Experts’ Ratings of Statement Extremity Be Used to Streamline the Development of Unidimensional Pairwise Preference Scales?

ORGANIZATIONAL RESEARCH METHODS · 2010

被引 19

人大 A-ABS 4

Stephen Stark · 南佛罗里达大学通讯
Oleksandr S. Chernyshenko · 南洋理工大学
Nigel Guenole

中文导读

研究了在计算机自适应测试中，能否用主题专家对陈述极端性的评分替代边际最大似然估计来简化量表开发，发现专家评分误差对分数准确性和效度影响很小。

Abstract

Interest in on-demand noncognitive assessment has flourished due to advances in computer technology and studies demonstrating noteworthy predictive validities for organizational outcomes. Computerized adaptive testing (CAT) based on the Zinnes-Griggs (ZG) ideal point item response theory (IRT) model may hold promise for organizational settings, because a large pool of items can be created from a modest number of stimuli, and the items have been shown to be resistant to some types of rater bias. However, sample sizes needed for marginal maximum likelihood (MML) estimation of statement parameters are quite large and could thus limit usefulness in practice. This article addresses that concern and its ramifications for CAT. Specifically, we conducted empirical and simulation studies to examine whether subject matter expert (SME) ratings of statement extremity (location) can be substituted for MML estimates to streamline test development and launch. Results showed that error in SME-based location estimates had little detrimental effect on score accuracy or validity, regardless of whether measures were constructed adaptively or nonadaptively. Implications for research involving small samples and CAT in field settings are discussed.

心理测量学计算机自适应测试项目反应理论组织心理学非认知评估

阅读原文 ↗