What question did this study set out to answer?

The aim is to design and evaluate a retrieval-augmented generation (RAG) system for guided maintenance tasks in industrial settings.

February 21, 2026Open Access

Guided Operations and Maintenance with Retrieval-Augmented Generation: System Design and Evaluation at a Machine Manufacturer

Key Points

The aim is to design and evaluate a retrieval-augmented generation (RAG) system for guided maintenance tasks in industrial settings.
Developed a RAG system using documentation from a machine manufacturer
Evaluated based on conformance to context, completeness of answers, and response latency
Utilized both manual and automated evaluation methods for rigorous assessment
Achieved over 90% conformance with provided context
Demonstrated more than 80% answer completeness regarding user queries
Validated technical feasibility and practical relevance of the system

Abstract

The increasing complexity of production systems and the shortage of skilled labor highlight the growing importance of digital services that support machine operators in guided operations and maintenance tasks, such as fault diagnosis and repair. Recent advances in foundation models with sophisticated language and image processing capabilities offer promising new avenues for natural human-machine interaction, improved information retrieval, and effective knowledge management in industrial contexts. However, challenges remain in the integration of domain-specific knowledge into these models, particularly in minimizing hallucinations and ensuring accurate, reliable system behavior. Additionally, general evaluation metrics often fail to capture the nuanced performance of retrieval-augmented generation (RAG) systems in specific industrial domains, calling for rigorous, domain-aware validation approaches. This paper presents the design and evaluation of a RAG system for guided operations and maintenance, developed using documentation from a machine manufacturer. The system is evaluated based on three key criteria: conformance with provided context, completeness of answers in relation to the user query, and response latency. An orchestrated approach combining manual and automated evaluation methods is proposed to assess the individual components of the RAG pipeline, including database design, retrieval quality, contextual prompting, and foundation model selection. Results from the initial prototype demonstrate over 90% conformance and more than 80% answer completeness, validating both the technical feasibility and practical relevance of foundation model-based support systems for this application. The study contributes a novel evaluation approach and provides empirical evidence for the integration of RAG architectures in industrial guided operations and maintenance scenarios.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Wulf et al. (Thu,) studied this question.

synapsesocial.com/papers/69994b64873532290d01f97c https://doi.org/https://doi.org/10.1016/j.procir.2025.09.030

Bookmark

View Full Paper