Large Language Models (LLMs) exhibit strong generative capabilities but remain limited in structured knowledge domains due to factual inconsistency, shallow multi-hop reasoning, and weak alignment with domain constraints. To address these limitations, this paper presents EduRAG-Compose, a unified hybrid retrieval–generation architecture that integrates hierarchical coarse-to-fine retrieval, dynamic knowledge graph traversal, cache-augmented inference, and compositional multi-step generation into a single framework for structured domain reasoning. The architecture combines clustered dense retrieval for efficient evidence selection, graph-based dependency traversal for multi-hop inference, recursive retrieval–generation cycles for compositional reasoning, and a caching layer to reduce end-to-end latency. A domain-alignment classifier and a constrained decoding strategy further support structured and interpretable output generation consistent with underlying semantic dependencies. The framework is evaluated on structured-domain benchmarks including Science Question Answering (ScienceQA), Textbook Question Answering (TQA), and interaction-derived query sets from EdNet, focusing on retrieval quality, reasoning coherence, and generative consistency. Across these datasets, EduRAG-Compose achieves 91.2 ± 1.3% factual accuracy, outperforming vanilla Retrieval-Augmented Generation (RAG) at 78.4 ± 2.1%, ColBERT-based Retrieval-Augmented Generation (ColBERT-RAG) at 82.9 ± 1.9%, and Generative Pre-trained Transformer with retrieval (GPT + RAG) at 88.5 ± 1.5%. Domain-alignment performance improves from 0.68 ± 0.05 to 0.89 ± 0.03, while structural reasoning consistency increases from 0.61 ± 0.06 to 0.87 ± 0.04, with an average response latency of 4.8 ± 0.7 s, comparable to baseline retrieval systems and substantially lower than Chain-of-Thought (CoT) prompting. All reported improvements are statistically significant based on paired Wilcoxon signed-rank tests (p < 0.01). Qualitative reasoning traces further demonstrate that explicit retrieval paths and graph traversal sequences enhance interpretability and support human-in-the-loop verification. Rather than introducing new retrieval or generation algorithms, the primary contribution lies in the systematic integration of complementary retrieval and reasoning mechanisms into a single transparent architecture for structured domains. Limitations related to dataset scope, computational overhead, and text-only processing are acknowledged, and future work will explore multimodal extensions, computational optimization for real-time deployment, and adaptive reasoning mechanisms capable of incorporating continuous feedback or evolving knowledge sources. Unified Hybrid RAG Framework: Proposes EduRAG-Compose integrating hierarchical retrieval, graph reasoning, caching, and compositional generation. Statistically Proven Gains: Achieves significant improvements in factual accuracy and domain alignment over strong baselines (p < 0.01). Explainable Multi-Hop Reasoning: Enables transparent inference via explicit retrieval paths and graph traversal traces. Scalable Structured-Domain Design: Supports efficient and portable deployment across diverse structured knowledge domains.
Building similarity graph...
Analyzing shared references across papers
Loading...
Rathinasamy Muthusami
Pennsylvania College of Health Sciences
Kandhasamy Saritha
Discover Artificial Intelligence
Building similarity graph...
Analyzing shared references across papers
Loading...
Muthusami et al. (Tue,) studied this question.
synapsesocial.com/papers/69b3abd602a1e69014ccd0f7 — DOI: https://doi.org/10.1007/s44163-026-01059-9