监督机器学习在预测人力资源管理结果中的分类性能:一项使用交叉分类多层模型的元分析

Classification Performance of Supervised Machine Learning to Predict Human Resource Management Outcomes: A Meta‐Analysis Using Cross‐Classified Multilevel Modeling

HUMAN RESOURCE MANAGEMENT · 2025
被引 1
人大 AFT50

中文导读

通过元分析249项研究中的6605个效应量,评估了监督机器学习模型在人力资源管理中的分类性能,发现提升法和随机森林表现最佳但计算成本高,决策树则兼具低成本和可解释性。

Abstract

ABSTRACT Using signal detection theory, we meta‐analyzed existing research testing the classification performance of supervised machine learning (ML) models applied in human resource (HR) contexts. Our meta‐analysis contained 6605 effect sizes cross‐nested by study ( N 1 = 249) and unique dataset ( N 2 = 152). We conducted separate cross‐classified multilevel modeling analyses predicting six different classification performance indices. We tested hypotheses regarding the effects of algorithm type, sample size, number of predictors used, the number of outcome classes, and outcome class imbalance on model classification performance. Boosting and random forest algorithms performed best across classification performance indices. However, both come at relatively great computational expense, and solutions can be difficult to explain. Decision trees were the best‐performing algorithms with relatively lower computational expense and easily interpretable solutions. We found limited evidence for the effect of sample size on classification performance. We found stronger support suggesting that using more predictors to train machine learning models results in better classification performance, but only when those predictors have substantive value. Finally, we found strong evidence that outcome class imbalance influences classification performance indices differently. More imbalanced outcome classes are associated with higher accuracy scores, which can be misleading, and lower precision and F1 scores, which may be better estimates of true classification performance. Our findings provide valuable insights into developing ML tools better suited to support HR functions.

人力资源管理机器学习元分析分类性能