What question did this study set out to answer?

The study aims to improve churn prediction accuracy by proposing an enhanced feature selection method called GRAS.

March 16, 2026Open Access

An enhanced Gravitational Search Algorithm for feature selection in telecom churn prediction

Key Points

The study aims to improve churn prediction accuracy by proposing an enhanced feature selection method called GRAS.
Developed GRAS by integrating Gravitational Search Algorithm and Simulated Annealing.
Evaluated GRAS using two classifiers: KNN and Random Forest on four public telecom datasets.
Compared GRAS's performance with Genetic Algorithm, standalone Gravitational Search Algorithm, and Simulated Annealing.
Conducted Friedman-Nemenyi tests for statistical validation of results.
GRAS outperformed GA, GSA, and SA in feature selection and speed.
Achieved the best accuracy, precision, recall, F1, and AUC with KNN on the largest dataset.
GRAS consistently selected the smallest feature subsets while maintaining model interpretability.
SHAP analysis showed 56.7% overlap between GRAS-selected and top-ranked features.

Abstract

• An enhanced feature selection method (GRAS) is proposed for churn prediction. • This method combines Gravitational Search Algorithm with Simulated Annealing. • Benchmarked with KNN and RF on four public telecom datasets, it outperforms GA, GSA, and SA. • GRAS selects smaller feature sets and runs faster than GA/GSA; results are Friedman-Nemenyi validated. • SHAP analysis aligns with GRAS-selected features, supporting interpretability and scalability. Customer churn poses a significant challenge to telecommunications companies, as retaining existing customers is more cost-effective than acquiring new ones. Accurate churn prediction enables timely interventions, reduces revenue loss, and enhances customer satisfaction. We propose GRAS, a feature selection method that integrates the global exploration of the Gravitational Search Algorithm (GSA) with the local refinement of Simulated Annealing (SA), aiming to improve the efficiency and interpretability. We evaluate GRAS with two base learners, k -Nearest Neighbors (KNN) and Random Forest (RF), on four publicly available churn datasets and benchmark its performance against metaheuristic baselines: Genetic Algorithm (GA), standalone GSA, and SA. The largest dataset contains 58 features and 51,047 samples and is included to stress-test scalability. The other three are smaller, with 11–21 features and 3,333–7,043 samples. On the largest dataset, GRAS with KNN attains the best Accuracy, Precision, Recall, F1, and AUC among GA, GSA, and SA. With RF, GRAS remains competitive across datasets and consistently selects the smallest feature subsets (lowest OFS), yielding compact and interpretable models. GRAS is markedly faster than GSA and GA, though slower than single-trajectory SA. These differences are confirmed by the Friedman test with Nemenyi post-hoc analysis. To support transparency, we conduct a SHAP-based analysis with OFS-matched cutoffs and observe an average 56.7% overlap (range 44.4%–69.2%) between GRAS-selected features and top-ranked SHAP features. Overall, GRAS shows scalable, statistically validated performance and selects business-relevant features, making it a practical choice for churn management pipelines where predictive quality and explainability are required.

Read Full Paperexternally

AI에게 질문

Bookmark

View Full Paper

Cite This Study

Hendro et al. (Sun,) studied this question.

synapsesocial.com/papers/69b79df38166e15b153ab197 https://doi.org/https://doi.org/10.1016/j.rineng.2026.109989

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

AI에게 질문

Bookmark

View Full Paper