September 1, 2024

Learnings from curating a trustworthy, well-annotated, and useful dataset of disordered English speech

Key Points

Key points are not available for this paper at this time.

Abstract

Project Euphonia, a Google initiative, is dedicated to improving automatic speech recognition (ASR) of disordered speech. A central objective of the project is to create a large, high-quality, and diverse speech corpus. This report describes the project's latest advancements in data collection and annotation methodologies, such as expanding speaker diversity in the database, adding human-reviewed transcript corrections and audio quality tags to 350K (of the 1.2M total) audio recordings, and amassing a comprehensive set of metadata (including more than 40 speech characteristic labels) for over 75% of the speakers in the database. We report on the impact of transcript corrections on our machine-learning (ML) research, inter-rater variability of assessments of disordered speech patterns, and our rationale for gathering speech metadata. We also consider the limitations of using automated off-the-shelf annotation methods for assessing disordered speech.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Panpan Jiang

Jimmy Tobin

Katrin Tomanek

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Learnings from curating a trustworthy, well-annotated, and useful dataset of disordered English speech

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study