What question did this study set out to answer?

To develop and evaluate a pipeline using large language models for extracting adverse drug events from electronic health records in inflammatory bowel disease.

January 23, 2026

P0826Large Language Models-Driven Real-World Pharmacovigilance in IBD: Open-Vocabulary, Adverse Drug Events Extraction from Multicenter Electronic Health Records

Key Points

To develop and evaluate a pipeline using large language models for extracting adverse drug events from electronic health records in inflammatory bowel disease.
Developed a novel end-to-end LLM pipeline for ADE detection.
Annotated 8406 electronic health record notes from multiple medical centers.
Utilized a high-recall pre-screening module and graph-based retrieval strategies.
Implemented ontology-aware normalization for MedDRA alignment.
Achieved an AUC of 0.809 in the Crohn's disease test set and 0.828 in the ulcerative colitis test set.
Identified drug-AE pairs with F1-scores of 0.577 for CD and 0.545 for UC cohorts.
Discovered the top five ADEs: bone marrow suppression, liver function abnormality, C.difficile infection, rash, and paresthesia.

Abstract

Abstract Background Adverse drug events (ADEs) are a major source of preventable harm. Inflammatory bowel disease (IBD) requires long-term multidrug management, making ADEs frequent and clinically significant. Extracting ADEs from electronic health records (EHRs) is central to pharmacovigilance but challenging due to overlapping disease activity and drug toxicity, complex polypharmacy, and heterogeneous Chinese clinical narratives. Conventional named entity recognition (NER)–relation extraction (RE) pipelines and fixed vocabularies often miss evolving expressions. Large language models (LLMs) show promise for ADE detection1, yet most current approaches are not end-to-end and rely on constrained annotation schemas, limiting scalability and real-world generalizability in IBD. Methods We developed an end-to-end LLM pipeline for open-vocabulary detection of ADEs following treatment with corticosteroids, immunomodulators, biologics, and small-molecule inhibitors in IBD. A total of 8406 IBD notes (Peking Union Medical College Hospital = 7936; Zunyi Medical University = 216; Guizhou Provincial People’s Hospital = 254) were annotated. The system directly reads clinical text, expands candidate events through knowledge-augmented retrieval, and normalizes outputs to MedDRA to ensure reliable and scalable pharmacovigilance. It integrates (i) a high-recall pre-screening module to retain plausible ADE signals while minimizing unnecessary LLM calls, (ii) graph-based retrieval over a drug–event bipartite network to broaden candidate scope, (iii) ensemble LLM inference guided by a self-learned instruction set, and (iv) ontology-aware normalization aligning terms with MedDRA and ensuring cross-center consistency. Results In the binary classification task of detecting the presence of any AE within a patient’s record, we ultimately select HYBRID + LR model for subsequent analyses which achieved Area Under the Curve (AUC) of 0.809 in CD test set(Figure1A), and 0.828 in UC test set(Figure1B). Our model then achieved the good performance on identifying drug-AE pairs: CD test set an F1-score of 0.577, a recall of 0.706, and a precision of 0.488 for the CD cohort; For the UC cohort, the model achieved an overall F1-score of 0.545, recall of 0.624, and precision of 0.484. We identified the top five ADEs in the IBD cohort: bone marrow suppression (n = 137), liver function abnormality (n = 134), C.difficile infection (n = 73), rash (n = 71), and paresthesia (n = 70). Conclusion The pipeline enables near real-time ADE detection and supports risk prediction in IBD. Embedding LLM-based pharmacovigilance in EHRs may deliver continuous safety surveillance and bridge clinical practice with regulatory science for data-driven, real-time monitoring. Reference: 1. Syrowatka A, Song W, Amato MG, et al. Key use cases for artificial intelligence to reduce the frequency of adverse drug events: a scoping review. Lancet Digit Health. Feb 2022;4(2):e137-e148. doi:10.1016/s2589-7500(21)00229-6 Conflict of interest: Ms. Wei, Yuge: None Ronghao, Li: None Gechong, Ruan: None Bai, Xiaoyin: None Yinghao, Sun: None Dejun, Cui: None Fang, Yan: None Huijun, Shu: None Xuemin, Yan: None Honglei, Liu: None Yang, Hong: None

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Y Wei

L Ronghao

R Gechong

Journals

Journal of Crohn s and Colitis

Actions

Institutions

Chinese Academy of Medical Sciences & Peking Union Medical College

Capital Medical University

Peking Union Medical College Hospital

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

P0826Large Language Models-Driven Real-World Pharmacovigilance in IBD: Open-Vocabulary, Adverse Drug Events Extraction from Multicenter Electronic Health Records

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study