What type of study is this?

September 10, 2025Open Access

LoRA-Tuned Multimodal RAG System for Technical Manual QA: A Case Study on Hyundai Staria

Key Points

The proposed system improved BERTScore by 3.0% and ROUGE-L by 18.0%, enhancing technical question answering.
Extraction from Hyundai Staria maintenance documents informed model development, incorporating images alongside text.
The enhanced RAG architecture utilized LoRA and achieved notable improvements in image-guided response accuracy.
Qualitative evaluations by experts yielded an average satisfaction score of 4.4 out of 5, highlighting the model's applicability.

Abstract

This study develops a domain-adaptive multimodal RAG (Retrieval-Augmented Generation) system to improve the accuracy and efficiency of technical question answering based on large-scale structured manuals. Using Hyundai Staria maintenance documents as a case study, we extracted text and images from PDF manuals and constructed QA, RAG, and Multi-Turn datasets to reflect realistic troubleshooting scenarios. To overcome limitations of baseline RAG models, we proposed an enhanced architecture that incorporates sentence-level similarity annotations and parameter-efficient fine-tuning via LoRA (Low-Rank Adaptation) using the bLLossom-8B language model and BAAI-bge-m3 embedding model. Experimental results show that the proposed system achieved improvements of 3.0%p in BERTScore, 3.0%p in cosine similarity, and 18.0%p in ROUGE-L compared to existing RAG systems, with notable gains in image-guided response accuracy. A qualitative evaluation by 20 domain experts yielded an average satisfaction score of 4.4 out of 5. This study presents a practical and extensible AI framework for multimodal document understanding, with broad applicability across automotive, industrial, and defense-related technical documentation.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Yerin Nam

Hyeung‐Sik Choi

Jonggeun Choi

Journals

Applied Sciences

Actions

Institutions

Seoul National University of Science and Technology

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

LoRA-Tuned Multimodal RAG System for Technical Manual QA: A Case Study on Hyundai Staria

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider