What question did this study set out to answer?

The study aims to develop and evaluate machine learning models for depression detection in both English and Arabic social media text.

April 5, 2026Open Access

Multilingual depression screening via social media: comparative analysis of machine learning models on English and Arabic text

Puntos clave

The study aims to develop and evaluate machine learning models for depression detection in both English and Arabic social media text.
Developed multilingual machine learning models using Arabic and English tweet datasets.
Implemented a preprocessing pipeline including normalization and feature selection.
Applied Bag-of-Words and TF-IDF for feature representation.
Tested classifiers like Random Forest and SVM, addressing class imbalance with SMOTE.
RBF-SVM with TF-IDF outperformed other models with an F1-score of 98% on Arabic tweets.
Achieved an AUC of 0.996 for Arabic and F1-score of 94.2% on English tweets.
High-quality preprocessing and expert annotations significantly improved classification outcomes.

Resumen

Depression is a leading cause of disability worldwide, yet many individuals remain undiagnosed due to stigma, limited access to care, or lack of awareness. The growing use of social media provides a new opportunity for passive mental health screening through natural language processing and machine learning, particularly for low-resource languages such as Arabic that remain underrepresented in the literature. This study develops and evaluates multilingual machine learning models for detecting depression in social media text, using two balanced datasets: an Arabic corpus of 15,000 tweets and an English corpus of 99,590 tweets. The preprocessing pipeline incorporates normalization, negation and intensifier handling, and Chi-Square-based feature selection, with feature representation achieved through Bag-of-Words and TF-IDF. Classifiers including Random Forest, Linear SVM, and RBF-SVM were tested with SMOTE applied to address class imbalance. Results show that the RBF-SVM with TF-IDF consistently outperformed other models, achieving an F1-score of 98% and AUC of 0.996 on Arabic tweets, and an F1-score of 94.2% and AUC of 0.987 on English tweets. These outcomes highlight the impact of high-quality preprocessing, linguistic augmentation, and expert-verified annotations in improving classification performance, particularly for Arabic data. The findings demonstrate that optimized traditional machine learning models can surpass more complex deep learning methods for depression detection, and contribute benchmark datasets and practical methodologies for advancing cross-lingual mental health informatics.

Me gusta

Guardar

Ver artículo completo