Key points are not available for this paper at this time.
This paper introduces the USPDATRO dataset. This is a speech dataset, in the Romanian language, constructed from open data, focusing on under-represented voice types (children, young and old people, and female voices). The paper covers the methodology behind the dataset construction, specific details regarding the dataset, and evaluation of existing Romanian Automatic Speech Recognition (ASR) systems, with different architectures. Results indicate that more under-represented speech content is needed in the training of ASR systems. Our approach can be extended to other low-resourced languages, as long as open data are available.
Building similarity graph...
Analyzing shared references across papers
Loading...
Vasile Păiș
Artificial Intelligence Research Institute
Verginica Barbu Mititelu
Academy of Romanian Scientists
Elena Irimia
Artificial Intelligence Research Institute
Applied Sciences
Romanian Academy
Building similarity graph...
Analyzing shared references across papers
Loading...
Păiş et al. (Mon,) studied this question.
synapsesocial.com/papers/68e55b65e2b3180350ef935b — DOI: https://doi.org/10.3390/app14199043