March 18, 2024Open Access

Exploring Adapters with Conformers for Children’s Automatic Speech Recognition

Key Points

Key points are not available for this paper at this time.

Abstract

The high variability in acoustic, pronunciation, and linguistic characteristics of children's speech makes of children's automatic speech recognition (ASR) a complex task. Training a dedicated ASR model from scratch for children remains challenging, mainly due to the limited availability of children's data. To tackle this limitation, a common strategy involves fine-tuning a pre-trained ASR model. However, this approach faces challenges due to the diversity of speakers and data scarcity, especially when dealing with large ASR models like the Conformer. In this study, we explore an alternative approach known as Adapter transfer. Adapter transfer requires training fewer parameters and can be more effective in adapting large ASR models for children's speech. In this paper, we assess various Adapter configurations in the literature and introduce a novel configuration called Two Serial Adapter (TSA). The experimental results indicate that Adapter transfer consistently outperforms traditional fine-tuning across various configurations for the Conformer model.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Thomas Rolland

Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento

Alberto Abad

Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento

Actions

Institutions

Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento

Exploring Adapters with Conformers for Children’s Automatic Speech Recognition

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study