Key points are not available for this paper at this time.
Parameter-efficient fine-tuning (PEFT) methods, which train only a part of a model, yield efficient and effective models. Bottleneck approaches, such as adapters and low-rank adaptation (LoRA), have been found to be beneficial in numerous studies and are widely utilized. In this work, we propose and investigate an enhanced PEFT method that adds convolution to linear projection-based bottleneck approaches. We experiment with HuBERT, a representative speech model pre-trained with self-supervised learning, and fine-tune it for the automatic speech recognition (ASR) task to examine how the proposed PEFT method impacts training and inference. We demonstrate consistent performance improvements with a minimal increase in parameters and computational complexity.
Building similarity graph...
Analyzing shared references across papers
Loading...
Kwangyoun Kim
NetApp (United States)
Suwon Shon
NetApp (United States)
Yi‐Te Hsu
National Institutes of Health
Building similarity graph...
Analyzing shared references across papers
Loading...
Kim et al. (Sun,) studied this question.
synapsesocial.com/papers/68e59e8eb6db6435875389d6 — DOI: https://doi.org/10.21437/interspeech.2024-2188