What question did this study set out to answer?

This study aims to create and validate a machine learning-based system for detecting Parkinson's disease from Arabic speech.

May 8, 2026Open Access

Machine Learning-Based Detection of Parkinson’s Disease From Arabic Speech: A Cross-Linguistic Validation Study

Key Points

This study aims to create and validate a machine learning-based system for detecting Parkinson's disease from Arabic speech.
Developed an Arabic PD speech dataset of 40 subjects (17 with PD and 23 controls).
Tested twelve machine learning classifiers employing feature extraction and dimensionality reduction techniques.
Validated methodology on an independent Spanish cohort of 100 subjects using leave-one-out and k-fold cross-validation.
Achieved 90% accuracy, precision, recall, and F1-score with Linear Discriminant Analysis using leave-one-out cross-validation.
Linear Support Vector Classification reached 87.7% precision and 87.5% recall.
Tested methodology on Spanish dataset showed 83% accuracy, confirming cross-linguistic generalizability.

Abstract

Early detection of Parkinson’s disease (PD) through speech analysis offers significant clinical advantages, yet no validated tools exist for Arabic-speaking populations, representing a critical gap in global healthcare. Previous studies have relied on limited machine learning (ML) classifiers and voice attributes, which may introduce bias and hinder effective technique discovery. To address this, we developed an optimal PD prediction pipeline by testing multiple ML classifiers and feature extraction methods. We created the first Arabic PD speech dataset, comprising 40 subjects (17 with PD and 23 controls), and validated our methodology on an independent Spanish cohort of 100 subjects. Feature extraction included traditional, audio-to-text, and deep voice features from a pre-trained Whisper model. We employed feature selection and dimensionality reduction techniques to refine the dataset dimensions. Final features were assessed using twelve classifiers with leave-one-out and k-fold cross-validation for robust performance evaluation. Shapley additive explanations (SHAP) were utilized to determine feature importance as vocal biomarkers. Linear Discriminant Analysis achieved optimal performance with 90% accuracy, precision, recall, and F1-score using leave-one-out cross-validation. Linear Support Vector Classification also performed well, achieving 87.7% precision and 87.5% recall. When tested on the independent Spanish dataset, our methodology attained 83% accuracy, confirming cross-linguistic generalizability. SHAP analysis indicated that audio-to-text features provide contextual insights on fluency and coherence, while traditional features effectively capture acoustic variations. This study establishes the first validated Arabic PD speech classification system and demonstrates its universal applicability, laying the groundwork for global speech-based PD screening.

Machine Learning-Based Detection of Parkinson’s Disease From Arabic Speech: A Cross-Linguistic Validation Study

Key Points

Abstract

Cite This Study