11132 Background: Immune-related adverse events (irAEs) from immune checkpoint inhibitors (ICIs) can be lifelong, fatal, or require treatment discontinuation, substantially limiting the clinical benefit of immunotherapy. Despite their impact, predictors of irAE susceptibility remain poorly defined, largely due to the lack of scalable approaches for toxicity phenotyping. Here, we leverage large language models and integrated clinico-genomic data to identify determinants of irAEs. Methods: We developed a custom retrieval augmented generation and large language model (RAG-LLM) pipeline to automatically annotate 6 key adverse events (adrenal insufficiency, hepatotoxicity, hyperthyroidism, hypothyroidism, colitis, and pneumonitis) using free text from clinical notes across Memorial Sloan Kettering Cancer Center. We first validated RAG-LLM predictions using a gold standard prospectively collected adverse event dataset from 8,119 patients across 1,057 individual clinical trials. RAG-LLM imputations were then scaled to 55,406 (12,291 ICI-treated) patients. All patients had associated somatic and germline MSK-IMPACT panel sequencing data. Single nucleotide polymorphism (SNP) imputation was performed using GLIMPSE and time-to-event genome-wide association studies (GWAS) were performed using SPACox. HLA class I genotypes were imputed using HLA-HD. Random Survival Forest (RSF) models were trained to predict irAE occurrence. Results: The custom RAG-LLM pipeline had strong performance across all 6 irAEs (area under the ROC curve of 0.77-1.00). In the RAG-LLM imputed data, 799 (1.4%) patients had adrenal insufficiency, 2,803 colitis (5.1%), 7,130 hypothyroidism (12.9%), 448 hyperthyroidism (0.8%), 15,627 hepatotoxicity (28.2%), and 3,730 pneumonitis (6.7%). Among ICI-treated patients, pneumonitis (HR 1.4; 95% CI 1.1-1.8; p < 0.01) and hepatotoxicity (HR 1.4; 95% CI 1.2-1.6; p < 0.01) were associated with worse overall survival. The GWAS found two genome-wide significant hits that predicted adrenal insufficiency (rs115003145, HLA region) and hypothyroidism (rs7864322, FOXE1 gene enhancer region) with p < 5x10 -8 . Fine mapping of the HLA SNP showed that HLA-C*06:02 was specifically associated with increased risk of adrenal insufficiency in ICI-treated patients only (HR 1.6; 95% CI 1.2-2.2; p < 0.01). RSF model performance for irAE occurrence had F1 scores of 0.67-0.85. For pneumonitis, patients with the highest risk quartile by RSF model had 7.1% risk of pneumonitis at 1 year compared to 0.4% for the lowest risk quartile (HR 14.9; 95% CI 10.4-21.3; p < 0.01). Conclusions: We developed and validated a novel custom RAG-LLM pipeline that allows automatic annotation of irAEs using clinical notes. Using this pipeline, we identified novel biomarkers of ICI-related adrenal insufficiency and hypothyroidism. Finally, we developed predictive models for irAE prediction that can be used at the point of care.
Building similarity graph...
Analyzing shared references across papers
Loading...
Ziad Bakouny
Memorial Sloan Kettering Cancer Center
X. Alex Guo
Memorial Sloan Kettering Cancer Center
Rohan Walser
Memorial Sloan Kettering Cancer Center
Journal of Clinical Oncology
Memorial Sloan Kettering Cancer Center
Building similarity graph...
Analyzing shared references across papers
Loading...
Bakouny et al. (Wed,) studied this question.
synapsesocial.com/papers/6a192f07fab5b468c44185f4 — DOI: https://doi.org/10.1200/jco.2026.44.16_suppl.11132