🌙

基于朗之万动力学的贝叶斯违约概率模型

Bayesian probability of default models with Langevin dynamics

Quantitative Finance · 2025
被引 0
人大 BABS 3

中文导读

提出两种基于随机梯度朗之万动力学的机器学习架构,用于估计贷款申请人的违约概率,兼顾计算效率与贝叶斯方法的优势,并在四个零售贷款数据集上进行了基准测试。

Abstract

Using machine learning to estimate the probability of default in credit risk is becoming a popular approach. Bayesian methodologies offer a probabilistic interpretation of model predictions and prevent overfitting, a significant flaw in numerous machine learning models. However, Bayesian inference based on Monte Carlo Markov Chain (MCMC) algorithms comes with high computational costs. For credit scoring models efficiency and performance are equally important features. We propose two machine learning architectures based on Stochastic Gradient Langevin Dynamics (SGLD) to estimate the probability of default of loan applicants. This framework (i) allows us to sample from the true posterior without relying on typical MCMC algorithms, (ii) it is not computationally expensive and (iii) it leverages the strength of Bayesian approaches, such as the flexibility to regularization. We apply this method to Bayesian Logistic Regression and Bayesian Neural Network. Furthermore, we perform a benchmarking analysis with different models and regularization techniques on four large retail loan datasets. We also address model explainability with the model-agnostic method of Shapley Additive Explanation (SHAP).

信用风险机器学习贝叶斯统计违约概率预测