高利害情境下的人格测量：分级配对比较法能否成为传统迫选法的更可靠替代方案？

Measuring Personality When Stakes Are High: Are Graded Paired Comparisons a More Reliable Alternative to Traditional Forced-Choice Methods?

ORGANIZATIONAL RESEARCH METHODS · 2024

被引 3

人大 A-ABS 4

Harriet Lingel 通讯
Paul‐Christian Bürkner
Klaus G. Melchers
Niklas Schulte · 柏林自由大学

中文导读

通过模拟960种条件和实证研究，比较了分级配对比较法与传统迫选法在高利害情境下的人格测量性能，发现优化项目组合可提高信度并降低自比性。

Abstract

In graded paired comparisons (GPCs), two items are compared using a multipoint rating scale. GPCs are expected to reduce faking compared with Likert-type scales and to produce more reliable, less ipsative trait scores than traditional binary forced-choice formats. To investigate the statistical properties of GPCs, we simulated 960 conditions in which we varied six independent factors and additionally implemented conditions with algorithmically optimized item combinations. Using Thurstonian IRT models, good reliabilities and low ipsativity of trait score estimates were achieved for questionnaires with 50% unequally keyed item pairs or equally keyed item pairs with an optimized combination of loadings. However, in conditions with 20% unequally keyed item pairs and equally keyed conditions without optimization, reliabilities were lower with evidence of ipsativity. Overall, more response categories led to higher reliabilities and nearly fully normative trait scores. In an empirical example, we demonstrate the identified mechanisms under both honest and faking conditions and study the effects of social desirability matching on reliability. In sum, our studies inform about the psychometric properties of GPCs under different conditions and make specific recommendations for improving these properties.

心理学心理测量学人格心理学项目反应理论

阅读原文 ↗