🌙

基于正则化对角距离度量学习的信用评估特征选择与分组效应分析代码和数据仓库

Code and Data Repository for Feature Selection and Grouping Effect Analysis for Credit Evaluation via Regularized Diagonal Distance Metric Learning

INFORMS journal on computing · 2024
被引 3
人大 BUTD24ABS 3

中文导读

该仓库实现了正则化对角距离度量学习模型,通过特征缩放改进距离度量、选择特征并分析分组效应,适用于大规模金融数据信用评估,不追求线性可分性并考虑特征相关性。

Abstract

This repository provides an implementation of the regularized diagonal Distance Metric Learning (DML) model, which improves distance metrics, selects features, and conducts grouping effect analysis by rescaling the features. One characteristic of the proposed model is that it does not pursue linear separability, which is highly unrealistic in financial data. Another characteristic of the proposed model is that it considers correlated features when conducting feature selection, and thus, does not neglect important credit risk sources when used for credit evaluation. The implementation of the solver based on the Alternating Direction Method of Multipliers(ADMM) makes it suitable for large-scale financial applications. The repository also provides the scripts, data, and experimental results reported in the paper. This repository includes four folders, src, scripts, data, and results.

信用评估特征选择距离度量学习金融数据挖掘机器学习