What does this research mean for the field?

A data and knowledge cross-level fusion-driven learning framework outperforms expert systems and LLM baselines in detecting missed diagnoses in electronic medical records, identifying omissions in 37.8% of records and affecting DRG groupings in 9.0%. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

The aim is to develop a framework for automatically identifying missed diagnoses in electronic medical records.

May 16, 2026Open Access

A data and knowledge cross-level fusion-driven learning framework for detecting missing diagnosis

Key Points

The aim is to develop a framework for automatically identifying missed diagnoses in electronic medical records.
Implemented a cross-level fusion-driven learning framework for diagnosis detection.
Evaluated using real-world electronic medical records from six hospitals across China.
Adopted a hybrid approach combining the model with expert systems to improve precision.
37.8% of electronic medical records predicted to have missed diagnoses.
9.0% of cases experienced altered DRG groupings, impacting 3.2% of insurance reimbursements.
Increased precision by 6.7–13.4% through the hybrid approach with expert systems.

Abstract

Abstract Diagnosis omission in discharge diagnosis lists is common in electronic medical records (EMRs), leading to inaccurate documentation, incorrect Diagnosis Related Group (DRG) assignments, and reduced reimbursements from overlooked Complications and Comorbidities (CC) or Major Complications and Comorbidities (MCC). To address this, we propose a data and knowledge cross-level fusion-driven learning framework for automated identification of missed diagnoses. Evaluated on real-world EMRs from six hospitals across various provinces in China, our model outperforms expert system method, BERT-based method, and multiple LLM-based baseline methods, demonstrating superior F1 scores. Results show 37.8% of EMRs predicted to have missed diagnoses, with 9.0% experiencing altered DRG groupings, subsequently affecting 3.2% of insurance reimbursement. To minimize alert fatigue, we adopted a hybrid approach combining our model with expert system, boosting precision by 6.7–13.4%. We also designed two human-machine coupling modes to demonstrate the utility of our methods in the real world.

Bookmark

View Full Paper

Bookmark

View Full Paper

A data and knowledge cross-level fusion-driven learning framework for detecting missing diagnosis

Key Points

Abstract

Cite This Study