What does this research mean for the field?

Character-level 3-grams combined with Logistic Regression achieve the best performance for SMS spam detection, reaching 98.55% accuracy and 98.55% precision. Novelty: ClaimNovelty.CONFIRMATORY. Consensus alignment: ConsensusAlignment.SUPPORTS_CONSENSUS.

What question did this study set out to answer?

This research aims to evaluate how different feature representation techniques affect the accuracy of SMS spam detection using classical machine learning models.

February 24, 2026Open Access

Revisiting SMS Spam Detection: The Impact of Feature Representation on Classical Machine Learning Models

Key Points

This research aims to evaluate how different feature representation techniques affect the accuracy of SMS spam detection using classical machine learning models.
Evaluated seven feature representation techniques with six machine learning classifiers.
Used the SMSSpamCollection dataset containing 5574 SMS messages.
Assessed 42 configurations with 10-fold cross-validation.
Emphasized precision and specificity to minimize false positives.
Analyzed feature-classifier interaction patterns.
Achieved 98.55% accuracy with character-level 3-grams and Logistic Regression.
Obtained 98.55% precision and 90.50% recall for the spam class.
Linear SVM also performed well, suggesting effectiveness of linear models with expressive features.
Demonstrated the crucial impact of feature representation over classifier complexity.

Abstract

The proliferation of unsolicited short messages (SMS spam) poses persistent challenges to mobile communication security and user privacy. This study presents a systematic benchmarking and analytical investigation of classical machine learning approaches for SMS spam detection, focusing on the impact of text feature representation under imbalanced short-text conditions.In practical SMS filtering systems, minimizing false positives (i.e., incorrectly blocking legitimate messages) is a critical operational constraint. Therefore, beyond overall accuracy, precision and specificity are emphasized to ensure reliable preservation of legitimate communication. Using the SMSSpamCollection dataset (5574 messages: 747 spam and 4827 ham), seven feature representation techniques were evaluated in combination with six widely adopted classifiers, resulting in 42 configurations assessed under 10-fold cross-validation. The results demonstrate that feature representation plays a more critical role than classifier complexity. Character-level 3-grams combined with Logistic Regression achieved the best overall performance, reaching 98.55% accuracy, with 98.55% precision and 90.50% recall for the spam class (F1-score = 94.32%), and 0.9893 AUC. Linear SVM produced comparable results, highlighting the effectiveness of linear models when paired with expressive representations. Beyond reporting performance metrics, this study analyzes feature–classifier interaction patterns and clarifies practical trade-offs between precision, recall, and computational efficiency. The findings provide reproducible baselines and structured guidance for designing efficient SMS spam filtering systems.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Soysaldı et al. (Sat,) studied this question.

synapsesocial.com/papers/699d3fe6de8e28729cf64b55 https://doi.org/https://doi.org/10.3390/electronics15040894

Bookmark

View Full Paper