Credit risk prediction is a cornerstone task in financial analytics, enabling lending institutions to evaluatethe probability that a borrower will default on an obligation. As financial data grows in volume andcomplexity, machine learning (ML) algorithms have emerged as powerful alternatives to traditionalstatistical scoring models. This paper presents a systematic empirical comparison of seven classicalsupervised machine learning algorithms — Logistic Regression, Decision Tree, Random Forest,Gradient Boosting, Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Naive Bayes —applied to a benchmark credit risk dataset. Each algorithm is evaluated across five performance metrics:Accuracy, Precision, Recall, F1-Score, and AUC-ROC. Experimental results demonstrate that ensemblemethods, particularly Gradient Boosting, achieve superior predictive performance (AUC-ROC = 0.941),followed closely by Random Forest (AUC-ROC = 0.923). Logistic Regression remains a strong andhighly interpretable baseline (AUC-ROC = 0.871). This study provides actionable insights for financialpractitioners and data scientists selecting appropriate ML models for credit scoring applications, anddiscusses trade-offs between interpretability, computational cost, and predictive accuracy.
Building similarity graph...
Analyzing shared references across papers
Loading...
Ilgar Nasirov (Sat,) studied this question.
synapsesocial.com/papers/69dc89473afacbeac03eb12e — DOI: https://doi.org/10.5281/zenodo.19511893
Ilgar Nasirov
Azerbaijan State University of Economics
Azerbaijan State University of Economics
Building similarity graph...
Analyzing shared references across papers
Loading...
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: