What type of study is this?

This is a Prospective Diagnostic Study study.

September 5, 2025

MTP3.04 Comparing Large Language Models in Predicting Multi-Disciplinary Team Meeting Outcomes

Key Points

Initial findings show both LLMs generate structured treatment plans according to clinical guidelines, indicating potential utility.
Concordance scores predict that Claude Opus 3 provides better reasoning in decision-making compared to Gemini in MDT outcomes.
The study involves assessing clinical information via LLMs and comparing treatment plans against actual MDT outcomes for validation.
Results may enhance decision-making processes, indicating Claude's capacity for handling complex clinical prompts and further imaging needs.

Abstract

Abstract Aim Multidisciplinary team meetings (MDT) are an essential part of modern cancer care, involving multiple specialists discussing patient diagnosis and management. Increased workloads and complexity of patients and treatment put pressure on these meetings, worsened by clinical commitment pressures. Large language models (LLMs) can understand large volumes of information and respond to specific queries and prompts, and are beginning to be used as adjuncts to clinical care. We aim to assess the difference between two LLMS, Claude Opus 3 and Gemini, in predicting MDT outcomes. Methods A prospective diagnostic concordance and validation study, assessing the LLM’s ability to interpret clinical information and provide guideline-based management recommendations. We will provide the LLMs with the same clinical information that will be presented at MDT and prompt them to create treatment plans based Association of Breast Surgery and National Institute of Health and Care Excellence guidelines. The treatments plans will be reviewed and given a concordance score against the actual MDT outcome, Results Initial results demonstrate that both LLMs produce structured treatment plans according to the guidelines. We would predict that there is good concordance with the MDT outcome but that Claude is able to provide more reasoning regarding justified decision making. Conclusions We predict that Claude Opus 3, a more advanced LLM trained with professional data including medical data, would have better concordance with actual MDT outcomes. We predict it will also be able to understand more complex prompts including the need for further investigation and imaging.

Bookmark

MTP3.04 Comparing Large Language Models in Predicting Multi-Disciplinary Team Meeting Outcomes

Key Points

Abstract

Cite This Study

Also Consider

Also Consider