Credit risk prediction is a cornerstone task in financial analytics, enabling lending institutions to evaluatethe probability that a borrower will default on an obligation. As financial data grows in volume andcomplexity, machine learning (ML) algorithms have emerged as powerful alternatives to traditionalstatistical scoring models. This paper presents a systematic empirical comparison of seven classicalsupervised machine learning algorithms — Logistic Regression, Decision Tree, Random Forest,Gradient Boosting, Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Naive Bayes —applied to a benchmark credit risk dataset. Each algorithm is evaluated across five performance metrics:Accuracy, Precision, Recall, F1-Score, and AUC-ROC. Experimental results demonstrate that ensemblemethods, particularly Gradient Boosting, achieve superior predictive performance (AUC-ROC = 0.941),followed closely by Random Forest (AUC-ROC = 0.923). Logistic Regression remains a strong andhighly interpretable baseline (AUC-ROC = 0.871). This study provides actionable insights for financialpractitioners and data scientists selecting appropriate ML models for credit scoring applications, anddiscusses trade-offs between interpretability, computational cost, and predictive accuracy.
Building similarity graph...
Analyzing shared references across papers
Loading...
Ilgar Nasirov
Azerbaijan State University of Economics
Azerbaijan State University of Economics
Building similarity graph...
Analyzing shared references across papers
Loading...
Ilgar Nasirov (Sat,) studied this question.
synapsesocial.com/papers/69dc89473afacbeac03eb12e — DOI: https://doi.org/10.5281/zenodo.19511893
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: