What type of study is this?

This is a Prospective diagnostic concordance study study.

September 5, 2025

WTP5.04 Large Language Models for Highlighting Trial Patients in MDT

Key Points

Initial results show both LLMs provide structured treatment plans, improving MDT efficiency.
Concordance scores indicate good alignment with actual MDT outcomes, potentially benefiting patient care.
Claude Opus 3 demonstrates superior reasoning capabilities, enhancing clinical decision-making.
Guideline-based management is integral to the assessment of LLMs' capabilities in clinical settings.

Abstract

Abstract Aim Multidisciplinary team meetings (MDT) are a essential part of modern cancer care, involving multiple specialists discussing patient diagnosis and management. Increased workloads and complexity of patients and treatment put pressure on these meetings, worsened by other clinical commitment pressures. Large language models (LLMs) are able to understand large amount of information and respond to specific queries and prompts, and are beginning to be used as adjuncts to clinical care. To assess the difference between to LLMS, Claude Opus 3 and Gemini, in predicting MDT outcomes. Methods A prospective diagnostic concordance and validation study, assessing the LLM’s ability to interpret clinical information and provide guideline-based management recommendations. We will provide the LLMs with the same clinical information that will be presented at MDT and prompt them to create treatment plans based Association of Breast Surgery and National Institute of Health and Care Excellence guidelines. The treatments plans will be reviewed and given a concordance score against the actual MDT outcome. Results Initial results demonstrate that both LLMs produce treatment plans with structure and justification according to the guidelines. We would predict that there is good concordance with the MDT outcome but that Claude is able to provide more reasoning around further decision making. Conclusions We predict that Claude Opus 3, being a more advanced LLM that is trained with more professional data including medical data, would have better concordance with actual MDT outcomes. Claude also understands more complex prompts including around the need for further investigation and imaging.

Bookmark

Cite This Study

Badenoch et al. (Fri,) studied this question.

synapsesocial.com/papers/68bb3a432b87ece8dc9553e7 https://doi.org/https://doi.org/10.1093/bjs/znaf166.366

Bookmark