Key points are not available for this paper at this time.
Abstract Named Entity Recognition (NER) is extremely relevant in the clinical field since it allows the extraction of information, such as diagnoses or medical procedures, from non-structured data (doctor’s letters, vignettes, etc.) and coding them based on international classification systems. As a result, language models should be trained to recognize and classify these items accurately. While Large Language Models (LLMs) like ChatGPT are capable of recognizing medical entities in texts, they are not reliable at performing this task. Unlike English, where there are a variety of resources to assist with this task, other languages, such as German, lack appropriate language models. This study presents a methodology for the generation of high-quality full-synthetic datasets and the implementation of a workflow for the identification and classification of diseases, co-diseases, and medical procedures for clinical narratives in oncology.
Mustafa et al. (Wed,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: