Existing text-to-image models often rely on parameter fine-tuning techniques such as Low-Rank Adaptation (LoRA) to customize visual attributes. However, when combining multiple LoRA models for content-style fusion tasks, unstructured modifications of weight matrices often lead to undesired feature entanglement between content and style attributes. We propose QR-LoRA, a novel fine-tuning framework leveraging QR decomposition for structured parameter updates that effectively separate visual attributes. Our key insight is that the orthogonal Q matrix naturally minimizes interference between different visual features, while the upper triangular R matrix efficiently encodes attribute-specific transformations. Our approach fixes both Q and R matrices while only training an additional task-specific ΔR matrix. This structured design reduces trainable parameters to half of conventional LoRA methods and supports effective merging of multiple adaptations without cross-contamination due to the strong disentanglement properties between ΔR matrices. Experiments demonstrate that QR-LoRA achieves superior disentanglement in content-style fusion tasks, establishing a new paradigm for parameter-efficient, disentangled fine-tuning in generative models. The project page is available at: https: //luna-ai-lab. github. io/QR-LoRA/.
Building similarity graph...
Analyzing shared references across papers
Loading...
Jiahui Yang
Yugao Ma
Donglin Di
Building similarity graph...
Analyzing shared references across papers
Loading...
Yang et al. (Mon,) studied this question.
www.synapsesocial.com/papers/68f163c79903599108abcda8 — DOI: https://doi.org/10.48550/arxiv.2507.04599