What question did this study set out to answer?

This study aims to enhance retrieval-augmented generation methods for clinical diagnosis in traditional Chinese medicine.

April 25, 2026Open Access

TCM-DiffRAG: personalized syndrome differentiation reasoning method for traditional Chinese medicine based on knowledge graph and chain of thought

Puntos clave

This study aims to enhance retrieval-augmented generation methods for clinical diagnosis in traditional Chinese medicine.
Developed TCM-DiffRAG integrating knowledge graphs with chains of thought.
Evaluated performance on three distinctive TCM test datasets.
Compared effectiveness against native LLMs and directly supervised fine-tuned LLMs.
TCM-DiffRAG improved model scores from 0.927 to 0.952, 0.361 to 0.788, and 0.038 to 0.356 (McNemar test, p < 0.01).
The enhancements were more pronounced for non-Chinese LLMs.
Outperformed benchmark RAG methods and traditional fine-tuned LLMs.

Resumen

Background Retrieval-augmented generation (RAG) technology can empower large language models (LLMs) to generate more accurate, professional, and timely responses without fine-tuning. However, due to the complex reasoning processes and substantial individual differences involved in traditional Chinese medicine (TCM) clinical diagnosis and treatment, traditional RAG methods often exhibit poor performance in this domain. Objective To address the limitations of conventional RAG approaches in TCM applications, this study aims to develop an improved RAG framework tailored to the characteristics of TCM reasoning. Methods We developed TCM-DiffRAG, an innovative RAG framework that integrates knowledge graphs (KG) with chains of thought (CoT). TCM-DiffRAG was evaluated on three distinctive TCM test datasets. Results The experimental results demonstrated that TCM-DiffRAG achieved significant performance improvements over native LLMs. For example, the qwen-plus model achieved scores of 0.927, 0.361, and 0.038, which were significantly enhanced to 0.952, 0.788, and 0.356 with TCM-DiffRAG (McNemar test, p 0.01). The improvements were even more pronounced for non-Chinese LLMs. Additionally, TCM-DiffRAG outperformed directly supervised fine-tuned (SFT) LLMs and other benchmark RAG methods. Conclusion TCM-DiffRAG shows that integrating structured TCM knowledge graphs with Chain-of-Thought–based reasoning substantially improves performance in individualized diagnostic tasks. The joint use of universal and personalized knowledge graphs enables effective alignment between general knowledge and clinical reasoning. These results highlight the potential of reasoning-aware RAG frameworks for advancing LLM applications in traditional Chinese medicine.

Me gusta

Guardar

Ver artículo completo