Key points are not available for this paper at this time.
Detecting depression is a critical component of mental health diagnosis, and accurate assessment is essential for effective treatment. This study introduces a novel, fully automated approach to predicting depression severity using the E-DAIC dataset. We employ Large Language Models (LLMs) to extract depression-related indicators from interview transcripts, utilizing the Patient Health Questionnaire-8 (PHQ-8) score to train the prediction model. Additionally, facial data extracted from video frames is integrated with textual data to create a multimodal model for depression severity prediction. We evaluate three approaches: text-based features, facial features, and a combination of both. Our findings show the best results are achieved by enhancing text data with speech quality assessment, with a mean absolute error of 2.85 and root mean square error of 4.02. This study underscores the potential of automated depression detection, showing text-only models as robust and effective while paving the way for multimodal analysis.
Building similarity graph...
Analyzing shared references across papers
Loading...
Misha Sadeghi
Robert Richer
Bernhard Egger
SHILAP Revista de lepidopterología
npj Mental Health Research
Friedrich-Alexander-Universität Erlangen-Nürnberg
Helmholtz Zentrum München
Building similarity graph...
Analyzing shared references across papers
Loading...
Sadeghi et al. (Mon,) studied this question.
www.synapsesocial.com/papers/69dba56e74ec1634218361e3 — DOI: https://doi.org/10.1038/s44184-024-00112-8