What type of study is this?

October 8, 2025Open Access

R-LoRA: Randomized Multi-Head LoRA for Efficient Multi-Task Learning

Key Points

R-LoRA enhances performance in multi-task learning by increasing diversity in head matrices.
Experiments demonstrate improvements in training time and GPU memory usage with R-LoRA.
The method incorporates multi-head randomization strategies, including dropout and initialization.
R-LoRA provides a cost-effective fine-tuning solution for language models across various domains.

Abstract

Fine-tuning large language models (LLMs) is computationally expensive, and Low-Rank Adaptation (LoRA) provides a cost-effective solution by approximating weight updates through low-rank matrices. In real-world scenarios, LLMs are fine-tuned on data from multiple domains to perform tasks across various fields, embodying multi-task learning (MTL). LoRA often underperforms in such complex scenarios. To enhance LoRA's capability in multi-task learning, we propose R-LoRA, which incorporates Multi-Head Randomization. Multi-Head Randomization diversifies the head matrices through Multi-Head Dropout and Multi-Head Random Initialization, enabling more efficient learning of task-specific features while maintaining shared knowledge representation. Our approach not only improves performance in MTL but also reduces GPU memory usage and training time. Experiments show that R-LoRA's gains stem from increased diversity in the head matrices, demonstrating its effectiveness for multi-task learning. The code is available at https://github.com/jinda-liu/R-LoRA

Read Full Paperexternally

Ask AI

Mark Helpful

Bookmark

Relay

View Full Paper