What question did this study set out to answer?

This project aims to enhance nephrology fellowship training by utilizing AI to analyze clinical documentation and provide tailored educational feedback.

March 28, 2026Open Access

Wcn26-8686 Leveraging Artificial Intelligence to Deliver Precision Medical Education in Nephrology Fellowship Training

Key Points

This project aims to enhance nephrology fellowship training by utilizing AI to analyze clinical documentation and provide tailored educational feedback.
Extracted clinical encounters related to hyponatremia from nephrology fellows at an academic medical center.
Analyzed encounters using expert reviewers and various large language models (LLMs).
Mapped diagnoses to the ABIM nephrology blueprint and evaluated clinical reasoning through a validated tool.
Expert reviewers identified various hyponatremia diagnoses, with varying performance from different LLMs.
Qwen2.5 demonstrated the highest accuracy compared to other models.
Moderate agreement existed between expert reviewers and Qwen2.5 for disease identification, while clinical reasoning showed weak correlation.

Abstract

Introduction: A major barrier to actualizing precision medical education is performing the ongoing, continuous analysis necessary for assessment and iterative feedback to improve foundational knowledge and diagnostic reasoning.We are leveraging large language models (LLMs) in this pilot project to analyze nephrology fellow clinical documentation and map their diagnostic exposures to topics relevant to the practice of nephrology with the goal of providing subsequent targeted educational interventions based on each individual learner's needs.Methods: 50 nephrology fellow hyponatremia clinical encounters (47 inpatient and 3 outpatient) at a large academic medical center were extracted into a HIPAA compliant secure computing environment.These encounters were analyzed by two expert reviewers and by pretrained LLMs including MedGemma, Qwen2.5, and LLaMA3.We determined the underlying hyponatremia diagnoses present and mapped them to the ABIM nephrology blueprint.We evaluated clinical reasoning utilizing a validated tool (R-IDEA).Expert reviewer results were used as the "gold standard" and compared to LLM output to evaluate LLM performance.Cohen's kappa for inter-rater agreement was determined for hyponatremia diagnoses and Spearman correlation and Pearson correlation were determined for each R-IDEA clinical reasoning category.Results: Expert reviewers identified SIADH (11), hypervolemic hyponatremia ( 17), low solute intake (4), hyponatremia due to thiazide diuretic use (3), hypertonic hyponatremia (2), pseudohyponatremia (1), and hypotonic hyponatremia due to other causes (24) after manual review.LLM performance varied by model and across hyponatremia diagnoses.We found that Qwen2.5 performed best at this stage.Interrater reliability between expert reviewers and Qwen2.5 was moderate (Cohen's k 0.56).Correct identification by the LLM occurred most frequently for SIADH and least frequently for hypotonic hyponatremia due to thiazide diuretic use.We found weak agreement at this stage between LLM R-IDEA score and expert reviewers.Spearman correlation for total R-IDEA score was 0.361 and Pearson correlation was 0.320.Conclusion: This innovative use of LLMs is an initial proof of concept project that strives to improve nephrology fellow education via analysis of learner's real-world documentation with plans for subsequent targeted educational interventions to meet learners needs and improve clinical reasoning.We have demonstrated modest agreement between expert reviewers and readily available LLMs regarding hyponatremia diagnoses present and weak agreement when evaluating learner clinical reasoning.Continued efforts to optimize model performance are underway.Subsequent piloting of delivery of targeted educational interventions for learners based on real time evaluation of this data and scaling this system throughout the nephrology curriculum are planned next steps to enable continuous individualized learning throughout nephrology fellowship that is tailored to a specific fellow's needs.I have no potential conflict of interest to disclose.I did not use generative AI and AI-assisted technologies in the writing process.

Bookmark

View Full Paper

Cite This Study

Thorne et al. (Wed,) studied this question.

synapsesocial.com/papers/69c76fff8bbfbc51511e0672 https://doi.org/https://doi.org/10.1016/j.ekir.2026.106097

Bookmark

View Full Paper