A Multiobjective Genetic Algorithm-Based Unilateral Rule Extraction Model for Credit Risk Classification
提出一种基于多目标遗传算法的两阶段单边规则提取方法,解决信用风险分类中稀疏类别特征和模型可解释性问题,在保证高准确率的同时提取简洁易懂的分类规则。
This paper proposes a two-phase unilateral rule extraction method based on a multiobjective genetic algorithm (URE-MOGA) to solve the challenges of sparse categorical features and model interpretability in credit risk classification tasks. In this method, rule encoding is employed to represent the sparse categorical features without increasing data dimensionality. The Pareto dominance theory is utilized to balance the trade-off between model accuracy and interpretability. Additionally, a diversity function and unilateral rule learning strategy are designed to facilitate the acquisition of reasonable rules and classifiers by the URE-MOGA model. Furthermore, three publicly available datasets, along with six derived datasets, are used to validate the feasibility of the proposed URE-MOGA model. Experimental results demonstrate that the proposed model outperforms baseline methods in terms of accuracy, interpretability, and robustness. The adoption of a multiobjective approach enables the URE-MOGA model to extract more concise and comprehensible classification rules while ensuring high accuracy levels. Moreover, both the unilateral rule learning strategy and diversity function play vital roles in enhancing classifier accuracy and robustness within the URE-MOGA model. Overall, the proposed URE-MOGA model provides a novel insight into categorical feature sparsity handling interpretable modeling for credit risk classification tasks.