🌙

无限不平衡下的线性分类器

Linear Classifiers Under Infinite Imbalance

Operations Research · 2023
被引 1
人大 AFT50UTD24ABS 4*

中文导读

研究了数据严重不平衡时逻辑回归等线性分类器的极限行为,推导出系数向量的极限表达式,区分了亚指数与指数权重函数,并通过数值例子和信用风险案例展示了如何调整分类器以提升预测的特异性或敏感性。

Abstract

Understanding Linear Classifiers in the Face of Severe Data Imbalance In “Linear Classifiers Under Infinite Imbalance,” Paul Glasserman and Mike Li tackle the challenge of binary classification when data are severely imbalanced—a common dilemma in fields like healthcare and finance. They build upon the work of Owen by examining the behavior of logistic regression and extending the analysis to a broader class of linear discriminant functions. Their key contribution is the proof of infinite-imbalance limits for these functions’ coefficient vectors, providing explicit expressions for these limits and distinguishing between classifiers with subexponential and exponential weight functions. This distinction allows for a better understanding of how to adjust classifiers in the context of extreme imbalance, ultimately leading to improved specificity or sensitivity in predictions. The authors also link their findings to the concepts of robustness and conservatism in classification decisions, offering insight into optimal classifier design against the most challenging alternatives. The practical implications of their theoretical work are illustrated through numerical examples and a credit risk case study, offering a new perspective on managing classification tasks in the face of infinite imbalance.

机器学习二分类数据不平衡线性判别分析逻辑回归