A novel approach using explainable prediction of default risk in peer-to-peer lending based on machine learning models

dc.AffiliationOctober University for modern sciences and Arts MSA
dc.contributor.authorMarkus Atef
dc.contributor.authorShimaa Ouf
dc.contributor.authorWafaa Seoud
dc.contributor.authorMenna Ibrahim Gabr
dc.date.accessioned2025-08-11T16:30:18Z
dc.date.issued2025-08-04
dc.description.abstractOnline peer-to-peer (P2P) lending has expanded substantially during the previous decade globally. However, this quick expansion poses several potential risks as loan default risk in P2P lending remains unavoidable. As P2P lending has grown in both size and complexity, the challenges have also multiplied, leading to several complications, including high number of features, low-performing classification models and imbalanced dataset. Furthermore, machine learning models encounter another challenging issue known as the black-box problem. To overcome these challenges, the present work introduces a novel approach that involves tackling the dataset balancing issue using synthetic minority oversampling technique (SMOTE), employing carefully selected feature selection approaches (maximum relevance minimum redundancy (MRMR), sequential forward selection (SFS) and adaptive boosting (AdaBoost)) and machine learning such as nonlinear model (K-nearest neighbour (KNN)), tree-based model (random forest (RF)) and deep learning (multi-layer perceptron (MLP)). Compared to the previous studies, the present results showed that RF exhibited outstanding performance of 0.94, 0.94 and 0.99 in accuracy, F1-score and AUC, respectively. To address the black-box issue of the prediction model, enhance its interpretability and boost user trust, local interpretable model-agnostic explanations (LIME) and Shapley additive explanations (SHAP) explainable machine learning models were applied to the RF prediction model to elucidate its results. Furthermore, LIME and SHAP explainable machine learning models were applied to the RF prediction model, both with and without SMOTE resampling, to examine the influence of SMOTE resampling on the interpretability analysis of the RF prediction outcomes.
dc.description.urihttps://link.springer.com/journal/521/aims-and-scope
dc.identifier.citationAtef, M., Ouf, S., Seoud, W., & Gabr, M. I. (2025). A novel approach using explainable prediction of default risk in peer-to-peer lending based on machine learning models. Neural Computing and Applications. https://doi.org/10.1007/s00521-025-11489-8
dc.identifier.doihttps://doi.org/10.1007/s00521-025-11489-8
dc.identifier.otherhttps://doi.org/10.1007/s00521-025-11489-8
dc.identifier.urihttps://repository.msa.edu.eg/handle/123456789/6486
dc.language.isoen_US
dc.publisherSpringer nature link
dc.relation.ispartofseriesNeural Computing and Applications; 2025
dc.subjectP2P lending , Loan default risk , Machine learning prediction models , Explainable machine learning models
dc.titleA novel approach using explainable prediction of default risk in peer-to-peer lending based on machine learning models
dc.typeBook chapter

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
s00521-025-11489-8.pdf
Size:
1.97 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
51 B
Format:
Item-specific license agreed upon to submission
Description: