Key points are not available for this paper at this time.
Multi-task learning (MTL) approach leverages pre-trained models in speech and machine translation and has significantly advanced speech-to-text translation tasks. However, it introduces a considerable number of parameters, leading to increasing training costs. Most parameter-efficient fine-tuning (PEFT) methods only train additional modules to effectively reduce the number of trainable parameters. Nevertheless, the increase in trainable parameters caused by the PEFT method remains non-negligible in multilingual speech translation settings. In this paper, we first propose the parameter-sharing adapter, which reduces parameters by 7/8 compared to regular adapters, with only approximately 0.7% performance decrease. For the balance between model parameter quantity and performance, we present a neural network search (NAS) based model. Experimental results revealed that the performance of adapter is closest to fine-tuning, while LoRA exhibits the poorest performance.
Building similarity graph...
Analyzing shared references across papers
Loading...
Chen et al. (Sun,) studied this question.
www.synapsesocial.com/papers/68e59e92b6db643587538a7e — DOI: https://doi.org/10.21437/interspeech.2024-759
Nan Chen
Yonghe Wang
Feilong Bao
Building similarity graph...
Analyzing shared references across papers
Loading...