August 9, 2025

Voice for All: Evaluating the Accuracy and Equity of Automatic Speech Recognition Systems in Transcribing Patient Communications in Home Healthcare.

Puntos clave

Automatic speech recognition systems show reduced transcription accuracy for Black patients compared to White patients.
The word error rate for AWS General was 39%, the highest among the evaluated ASR systems.
Different linguistic domains, such as Affect and Social, posed significant challenges for ASR accuracy.
Diverse training datasets are necessary to enhance equity in automatic speech recognition performance within healthcare.

Resumen

Integrating automatic speech recognition (ASR) systems into home healthcare workflows can enhance risk prediction models. However, ASR systems exhibit disparities in transcription accuracy across racial and linguistic groups, highlighting an equity gap that could bias healthcare delivery. We evaluated four ASR systems-AWS General, AWS Medical, Whisper, and Wave2Vec-in transcribing 860 patient-nurse utterances (475 Black, 385 White). Word error rate (WER) was the primary measure. AWS General achieved the highest accuracy (median WER 39%), but all systems were less accurate for Black patients, particularly in linguistic domains "Affect," "Social," and "Drives." AWS Medical outperformed others on medical terms, although filler words, repetition, and nonmedical terms challenged every system. These findings underscore the need for diverse training datasets and improved dialect sensitivity to ensure equitable ASR performance and robust risk identification in home healthcare.

Me gusta

Guardar

Me gusta

Guardar

Voice for All: Evaluating the Accuracy and Equity of Automatic Speech Recognition Systems in Transcribing Patient Communications in Home Healthcare.

Puntos clave

Resumen

Cite This Study