Key points are not available for this paper at this time.
Dysarthria, a common symptom of motor neuron disease (MND), is primarily assessed through perceptual evaluations that are subjective, time-consuming, and require expert training. Acoustic analysis provides an objective alternative, leveraging either "global" features extracted from the entire speech signal or "segmental" features derived from individual phonemes (e.g., vowels). While segmental features offer finer-grained acoustic insights, their extraction has traditionally required manual segmentation, making the process time-consuming. This study explored the use of the Montreal Forced Aligner (MFA) for automatic vowel segmentation and compared the diagnostic utility of global versus segmental acoustic features in two machine learning tasks: dysarthria detection (i.e., distinguishing healthy controls (HCs) from speakers with dysarthria) and dysarthria severity classification (i.e., pre-, early-, and late-symptomatic stratification). Speech data were collected from 104 speakers with MND and 99 HCs. Global features were computed from voiced segments, while segmental features were derived from automatically aligned vowels. Four tree-based classifiers - Decision Tree, Random Forest, XGBoost, and LightGBM - were trained using 10-fold cross-validation. Feature importance was assessed using SHAP values, and statistical tests identified features with significant group differences. The MFA achieved alignment accuracy of at least 86% for healthy and early-symptomatic speakers, declining to 72% in late-stage dysarthria. For dysarthria detection, global features were significantly more effective than segmental features only in the XGBoost model. In contrast, segmental features significantly outperformed global features in dysarthria severity classification across all ensemble classifiers. These findings support the use of automated segmental analysis as an objective, viable, and clinically meaningful approach for assessing dysarthria in MNDs.
Attia et al. (Thu,) studied this question.