September 1, 2024

Shared-Adapters: A Novel Transformer-based Parameter Efficient Transfer Learning Approach For Children’s Automatic Speech Recognition

TRThomas RollandInstituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento AAAlberto AbadInstituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento

Key Points

Key points are not available for this paper at this time.

Abstract

Automatic Speech Recognition (ASR) often faces challenges in processing children's speech due to data scarcity. Training large ASR models becomes particularly challenging in such scenarios. To mitigate this issue, fine-tuning is commonly employed, leveraging pre-trained adult models. However, fine-tuning large pre-trained models with limited data poses its own challenges. In response, this study investigates Parameter-Efficient Finetuning (PEFT) for children's ASR. Various PEFT approaches are explored, with a specific emphasis on good ASR performance while minimising the number of parameters during training. Our investigation identifies residual Adapters as the most efficient technique. Moreover, motivated by Transformer-based model redundancies, we propose the Shared-Adapter and its highly parameter-efficient variant, the Light Shared-Adapter. Our findings demonstrate that Shared-Adapters strike an exceptional balance between recognition performance and parameter efficiency.

问 AI

Bookmark

View Full Paper

Cite This Study

Rolland et al. (Sun,) studied this question.

synapsesocial.com/papers/68e59e92b6db643587538ac4 https://doi.org/https://doi.org/10.21437/interspeech.2024-1105

问 AI

Bookmark

View Full Paper