What question did this study set out to answer?

The research aims to evaluate the effectiveness of various language models in simplifying biomedical texts for better comprehension.

April 17, 2026

Towards Benchmarking Transformer Models for Biomedical Text Simplification

Key Points

The research aims to evaluate the effectiveness of various language models in simplifying biomedical texts for better comprehension.
Utilized the Cochrane-Simplification dataset for experiments.
Compared general-purpose models with domain-specific models.
Applied metrics like ROUGE, BLEU, BERTScore, and SARI for evaluation.
Included models such as BART, PEGASUS, and BioBARTv2.
BART-based models demonstrated superior performance in text simplification tasks.
Domain-specific models showed improvements in semantic coherence.
General-purpose models effectively simplified biomedical texts but to a lesser extent.

Abstract

Biomedical texts typically contain a high level of technical terminology and complex sentence structures, which limits their comprehensibility for readers without domain expertise. Text simplification, a natural language processing problem, aims to transform complex texts into a more readable and accessible form while preserving their original semantic content. Especially in biomedical texts, simplification can play an essential role in making scientific information understandable to patients and the general public. In this context, this study investigates the text simplification performance of pre-trained general-purpose and domain-specific language models (PLMs) for biomedical texts. The experiments utilize the Cochrane-Simplification dataset, which comprises technical abstracts from systematic reviews and their corresponding plain language summaries. General-purpose models and summarization tuned variants (BART-Large, BART-Large-CNN, BART-Large-XSum, PEGASUS-Large, PEGASUS-XSum, T5 and FLAN-T5) are compared alongside domain-specific models (BioBARTv2-Large, SciFive, Clinical-T5) under comparable fine-tuning settings. The models were compared using ROUGE, BLEU, BERTScore and SARI metrics to measure textual similarity and semantic coherence. The results indicate that BART based models achieve superior performance in the medical text simplification task.

Bookmark

View Full Paper

Bookmark

View Full Paper

Towards Benchmarking Transformer Models for Biomedical Text Simplification

Key Points

Abstract

Cite This Study