What question did this study set out to answer?

This research aims to enhance the identification of high-risk cytogenetic abnormalities in multiple myeloma using a large language model.

May 29, 2026

Identifying high-risk cytogenetic subgroups in multiple myeloma using large language artificial intelligence models.

Key Points

This research aims to enhance the identification of high-risk cytogenetic abnormalities in multiple myeloma using a large language model.
Extracted oncology notes and pathology reports from the electronic health records of patients with multiple myeloma registered in the VA Cancer Registry System.
Annotated 1600-character text snippets for abnormalities by two physician reviewers to identify cytogenetic abnormalities and their types.
Utilized few-shot prompting with GPT-OSS 120B for extraction and validated the model on a test set of 985 snippets from 773 patients.
The model achieved a precision of 88.9%, recall of 82.8%, and F1 score of 85.7% for high-risk abnormalities per current guidelines.
From 7250 unlabeled notes, the model identified high-risk cytogenetics in 501 (6.9%) notes and any cytogenetic information in 5385 (74.3%).
It effectively adapted to multiple definitions of high-risk cytogenetic abnormalities, showing strong extraction capabilities.

Abstract

7523 Background: The Revised International Staging System (R-ISS) is the clinical standard for risk stratification in multiple myeloma (MM) and is central to treatment selection and prognostication. However, the cytogenetic abnormalities on which the R-ISS is predicated are typically reported in unstructured formats within bone marrow biopsy reports and clinical notes. Despite the critical importance and relevance of obtaining cytogenetic abnormalities in MM, this information is often missing from large clinical databases, limiting their availability for large scale research and real-world analyses. Here, we created a large language model (LLM)-based algorithm to identify cytogenetic abnormalities from unstructured text and evaluated our model’s adaptability by comparing its performance across definitions of high-risk abnormalities. Methods: We extracted oncology notes and pathology reports within 2 years of diagnosis from the EHR of adult patients with MM registered within the Department of Veterans Affairs (VA) Cancer Registry System between 2000 and 2024. Two physician reviewers iteratively annotated 1600-character snippets from these texts for abnormalities and their type (gain, deletion, translocation) plus normal cytogenetics. From these annotations we identified items with high-risk cytogenetic abnormalities per current guidelines (del(17p) in > 20% plasma cells; TP53 mutation; gain(1q) and/or del(1p32) plus t(4;14), t(14;16), or t(14;20); monoallelic del(1p32) plus gain(1q); biallelic del(1p32)) and previous guidelines (t(4;14), t(14;16), t(14;20), del TP53/17p, del(1p), gain(1q)). We used few-shot prompting with GPT-OSS 120B to extract cytogenetic results and validated our model on a test set of 985 snippets representing 773 patients at 98 VA centers. Results: The test set contained 81 snippets with normal cytogenetics and 464 with abnormalities; 58 were high-risk per current guidelines and 144 per previous guidelines. Our model successfully identified high-risk cytogenetics per current (P: 88.9%, R: 82.8%, F1: 85.7%) and previous guidelines (P: 86.7%, R: 90.3%, F1: 88.4%), as well as type of cytogenetic abnormality (weighted averages; P: 92.3%, R: 92.2%, F1: 92.4%), demonstrating its utility and flexibility for the task as standards change. Applied to a cohort of 7250 unlabeled notes, our model identified high-risk cytogenetics in 501 (6.9%) and any cytogenetic information in 5385 (74.3%). Conclusions: Our model demonstrated the successful capture of cytogenetic information across multiple definitions of high-risk cytogenetic abnormalities, providing a fast, adaptable framework for extraction that traditionally required repeated large-scale human efforts. Leveraging an LLM for this task enables scalable MM risk stratification while reducing clinician burden and supporting rapid adaptation to evolving cytogenetic risk definitions.

Mark Helpful

Bookmark

Relay