What type of study is this?

This is a Quantitative Study study.

September 12, 2025Open Access

Explainable AI-driven depression detection from social media using natural language processing and black box machine learning models

Key Points

SVM achieved the highest accuracy for depression detection from social media data, outperforming other models.
The integration of LIME provided insights into the linguistic markers relevant to psychological research.
This approach not only focused on predictive performance but also highlighted the importance of model interpretability.
Using advanced NLP techniques, the study extracted features crucial for understanding depression from user-generated content.

Abstract

Introduction Mental disorders are highly prevalent in modern society, leading to substantial personal and societal burdens. Among these, depression is one of the most common, often exacerbated by socioeconomic, clinical, and individual risk factors. With the rise of social media, user-generated content offers valuable opportunities for the early detection of mental disorders through computational approaches. Methods This study explores the early detection of depression using black-box machine learning (ML) models, including Support Vector Machines (SVM), Random Forests (RF), Extreme Gradient Boosting (XGB), and Artificial Neural Networks (ANN). Advanced Natural Language Processing (NLP) techniques TF-IDF, Latent Dirichlet Allocation (LDA), N-grams, Bag of Words (BoW), and GloVe embeddings were employed to extract linguistic and semantic features. To address the interpretability limitations of black-box models, Explainable AI (XAI) methods were integrated, specifically the Local Interpretable Model-Agnostic Explanations (LIME). Results Experimental findings demonstrate that SVM achieved the highest accuracy in detecting depression from social media data, outperforming RF and other models. The application of LIME enabled granular insights into model predictions, highlighting linguistic markers strongly aligned with established psychological research. Discussion Unlike most prior studies that focus primarily on classification accuracy, this work emphasizes both predictive performance and interpretability. The integration of LIME not only enhanced transparency and interpretability but also improved the potential clinical trustworthiness of ML-based depression detection models.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Hameed et al. (Thu,) studied this question.

synapsesocial.com/papers/68d44b3831b076d99fa54c82 https://doi.org/https://doi.org/10.3389/frai.2025.1627078

Bookmark

View Full Paper