Key points are not available for this paper at this time.
The study aims to determine a credit default prediction model using data from LendingClub. The model estimates the effect of the influential variables on the prediction process of paid and unpaid loans. We implemented the random forest algorithm to identify the variables with the most significant influence on payment or default, addressing nine predictors related to the borrower's credit and payment background. Results confirm that the model’s performance generates a F1 Macro Score that accomplishes 90% in accuracy for the evaluation sample. Contributions of this study include using the complete dataset of the entire operation of LendingClub available, to obtain transcendental variables for the classification and prediction task, which can be helpful to estimate the default in the person-to-person loan market. We can draw two important conclusions, first we confirm the Random Forest algorithm's capacity to predict binary classification problems based on performance metrics obtained and second, we denote the influence of traditional credit scoring variables on default prediction problems.
Building similarity graph...
Analyzing shared references across papers
Loading...
José Antonio Núñez Mora
University of Alicante
Pamela Moncayo
Tecnológico de Monterrey
Carlos Franco
Universidad Nacional de Colombia
Revista Mexicana de Economía y Finanzas
Tecnológico de Monterrey
Universidad Anáhuac
Building similarity graph...
Analyzing shared references across papers
Loading...
Mora et al. (Tue,) studied this question.
synapsesocial.com/papers/6a0fff53e6385ae0c9fcb9ed — DOI: https://doi.org/10.21919/remef.v18i3.886