What question did this study set out to answer?

The aim is to create a machine learning-based system that automates loan eligibility assessments to enhance efficiency, accuracy, and fairness.

May 30, 2026Open Access

Machine Learning‐Based Loan Approval Automation: Enhancing Efficiency, Accuracy and Fairness in Credit Decision‐Making

Key Points

The aim is to create a machine learning-based system that automates loan eligibility assessments to enhance efficiency, accuracy, and fairness.
Developed and compared multiple supervised ML models including Random Forest and XGBoost.
Applied SMOTE for class imbalance, using GridSearchCV for hyperparameter tuning with 5-fold cross-validation.
Evaluated model performance with metrics such as accuracy, precision, recall, F1-score, and AUC.
Tuned Random Forest achieved 85.96% accuracy, 87.17% F1-score, and 95.32% recall.
Cross-dataset validation showed comparable AUC values (0.90 vs. 0.91), confirming model generalisability.
All models exceeded the 0.80 disparate impact threshold in gender-based fairness evaluations.

Abstract

ABSTRACT Traditional loan approval processes are manual, time‐consuming and susceptible to human bias. This research develops a machine learning‐based system to automate loan eligibility assessment while enhancing efficiency, accuracy and fairness in credit decision‐making. We developed and compared multiple supervised ML models—including Random Forest, XGBoost, stacking and voting ensembles—on a publicly available loan dataset (614 instances, 13 features). To address class imbalance (68.7% approved, 31.3% rejected), we applied the Synthetic Minority Over‐sampling Technique (SMOTE). Hyperparameter tuning was performed using GridSearchCV with 5‐fold cross‐validation, optimising for F1‐score. Model performance was evaluated using accuracy, precision, recall, F1‐score and Area Under the Curve (AUC). Both Local Interpretable Model‐Agnostic Explanations (LIME) and SHapley Additive exPlanations (SHAP) were applied to ensure transparency. A tuned Random Forest classifier achieved the best performance with an accuracy of 85.96%, F1‐score of 87.17% and recall of 95.32%, outperforming XGBoost and ensemble methods. Cross‐dataset validation on the Statlog German Credit dataset (1000 instances, 20 features) confirmed the framework's generalisability, with comparable AUC values (0.90 vs. 0.91). Overfitting analysis confirms moderate generalisation gaps, and a gender‐based fairness evaluation shows all models exceed the 0.80 disparate impact threshold. This study demonstrates that a systematic framework combining rigorous hyperparameter tuning, class imbalance handling and explainable AI can create accurate, transparent and equitable loan approval systems, providing a practical blueprint for responsible AI deployment in credit decision‐making.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper