Key points are not available for this paper at this time.
Automatic Speech Recognition (ASR) often faces challenges in processing children's speech due to data scarcity. Training large ASR models becomes particularly challenging in such scenarios. To mitigate this issue, fine-tuning is commonly employed, leveraging pre-trained adult models. However, fine-tuning large pre-trained models with limited data poses its own challenges. In response, this study investigates Parameter-Efficient Finetuning (PEFT) for children's ASR. Various PEFT approaches are explored, with a specific emphasis on good ASR performance while minimising the number of parameters during training. Our investigation identifies residual Adapters as the most efficient technique. Moreover, motivated by Transformer-based model redundancies, we propose the Shared-Adapter and its highly parameter-efficient variant, the Light Shared-Adapter. Our findings demonstrate that Shared-Adapters strike an exceptional balance between recognition performance and parameter efficiency.
Building similarity graph...
Analyzing shared references across papers
Loading...
Thomas Rolland
Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento
Alberto Abad
Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento
Building similarity graph...
Analyzing shared references across papers
Loading...
Rolland et al. (Sun,) studied this question.
synapsesocial.com/papers/68e59e92b6db643587538ac4 — DOI: https://doi.org/10.21437/interspeech.2024-1105
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: