What question did this study set out to answer?

The aim is to compare various biomedical named entity recognition techniques applied to clinical notes.

March 12, 2026Open Access

Biomedical Named Entity Recognition for Clinical Notes: A Comparative Study

Key Points

The aim is to compare various biomedical named entity recognition techniques applied to clinical notes.
Evaluated SciSpacy pipeline and transformer models (BioBERT, PubMedBERT, ClinicalBERT)
Conducted experiments on approximately 5,000 clinical notes
Focused on implementation aspects like tokenization and runtime performance
SciSpacy pipeline showed faster processing and greater stability with large clinical note collections
Transformer models provided more flexible contextual representations
Found trade-offs between lightweight NER pipelines and transformer model complexity

Abstract

Biomedical Named Entity Recognition (NER) is an important task in clinical natural language processing, enabling the extraction of structured medical concepts from unstructured clinical text. This study presents a practical comparison of multiple biomedical NER approaches applied to clinical notes. Three different methods were evaluated: a SciSpacy-based pipeline and transformer-based models built on BioBERT/PubMedBERT and ClinicalBERT architectures. Experiments were conducted on approximately 5,000 clinical notes collected from publicly available medical transcription samples. The comparison focuses on practical implementation aspects including handling of long clinical notes, tokenization behavior, engineering complexity, and runtime performance. Results show that the SciSpacy pipeline provides significantly faster processing and greater stability when handling large collections of clinical notes, while transformer-based models offer more flexible contextual representations at the cost of increased computational overhead. These findings highlight the trade-offs between lightweight biomedical NLP pipelines and transformer-based models for large-scale clinical text processing.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper