A novel approach using explainable prediction of default risk in peer-to-peer lending based on machine learning models

Markus Atef; Shimaa Ouf; Wafaa Seoud; Menna Ibrahim Gabr

doi:https://doi.org/10.1007/s00521-025-11489-8

A novel approach using explainable prediction of default risk in peer-to-peer lending based on machine learning models

dc.Affiliation	October University for modern sciences and Arts MSA
dc.contributor.author	Markus Atef
dc.contributor.author	Shimaa Ouf
dc.contributor.author	Wafaa Seoud
dc.contributor.author	Menna Ibrahim Gabr
dc.date.accessioned	2025-08-11T16:30:18Z
dc.date.issued	2025-08-04
dc.description.abstract	Online peer-to-peer (P2P) lending has expanded substantially during the previous decade globally. However, this quick expansion poses several potential risks as loan default risk in P2P lending remains unavoidable. As P2P lending has grown in both size and complexity, the challenges have also multiplied, leading to several complications, including high number of features, low-performing classification models and imbalanced dataset. Furthermore, machine learning models encounter another challenging issue known as the black-box problem. To overcome these challenges, the present work introduces a novel approach that involves tackling the dataset balancing issue using synthetic minority oversampling technique (SMOTE), employing carefully selected feature selection approaches (maximum relevance minimum redundancy (MRMR), sequential forward selection (SFS) and adaptive boosting (AdaBoost)) and machine learning such as nonlinear model (K-nearest neighbour (KNN)), tree-based model (random forest (RF)) and deep learning (multi-layer perceptron (MLP)). Compared to the previous studies, the present results showed that RF exhibited outstanding performance of 0.94, 0.94 and 0.99 in accuracy, F1-score and AUC, respectively. To address the black-box issue of the prediction model, enhance its interpretability and boost user trust, local interpretable model-agnostic explanations (LIME) and Shapley additive explanations (SHAP) explainable machine learning models were applied to the RF prediction model to elucidate its results. Furthermore, LIME and SHAP explainable machine learning models were applied to the RF prediction model, both with and without SMOTE resampling, to examine the influence of SMOTE resampling on the interpretability analysis of the RF prediction outcomes.
dc.description.uri	https://link.springer.com/journal/521/aims-and-scope
dc.identifier.citation	Atef, M., Ouf, S., Seoud, W., & Gabr, M. I. (2025). A novel approach using explainable prediction of default risk in peer-to-peer lending based on machine learning models. Neural Computing and Applications. https://doi.org/10.1007/s00521-025-11489-8
dc.identifier.doi	https://doi.org/10.1007/s00521-025-11489-8
dc.identifier.other	https://doi.org/10.1007/s00521-025-11489-8
dc.identifier.uri	https://repository.msa.edu.eg/handle/123456789/6486
dc.language.iso	en_US
dc.publisher	Springer nature link
dc.relation.ispartofseries	Neural Computing and Applications; 2025
dc.subject	P2P lending , Loan default risk , Machine learning prediction models , Explainable machine learning models
dc.title	A novel approach using explainable prediction of default risk in peer-to-peer lending based on machine learning models
dc.type	Book chapter

Files

Original bundle

Now showing 1 - 1 of 1

Name:: s00521-025-11489-8.pdf
Size:: 1.97 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 51 B
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Faculty Of Management Sciences Research Paper