What question did this study set out to answer?

The aim is to create and test a system for extracting TNM classifications from radiology reports.

April 1, 2026Open Access

ASUKAI89 at NTCIR 18 RadNLP Task: Lung Cancer Staging Automatic Classification System Utilizing Large Language Models and Meta-Prompting

Key Points

The aim is to create and test a system for extracting TNM classifications from radiology reports.
Developed a system utilizing large language models for classification.
Incorporated explicit TNM criteria and unit specifications.
Performed error analysis and prompt improvements via meta-prompting.
Conducted evaluations using the `gemini-2.0-flash-thinking-exp-1219` and `o1 2024-12-01-preview` models.
Achieved an overall accuracy improvement of approximately 15% after modifications.
Final evaluations reached about 70% joint accuracy, 76% for T, 93% for N, and 95% for M accuracy.

Abstract

This study aims to develop and evaluate a system that automatically extracts the TNM classification of lung cancer (T: primary tumor, N: lymph node metastasis, M: distant metastasis) from radiological diagnosis reports. In the initial experiments, inference was performed using `gemini-2.0-flash-thinking-exp-1219`. By incorporating explicit TNM classification criteria and unit specifications—features absent in conventional methods—and introducing error analysis and prompt improvements through meta-prompting, an overall accuracy improvement of approximately 15% was achieved after prompt modification. In the final evaluation, using the `o1 2024-12-01-preview` model, we achieved approximately 70% joint accuracy (fine), 76% T accuracy, 93% N accuracy, and 95% M accuracy. This paper provides a detailed account of the experimental procedures and the improvement process at each stage.

Read Full Paperexternally

AIに質問

Bookmark

View Full Paper