What question did this study set out to answer?

To assess the effectiveness of an ensemble-based approach for classifying noisy clinical dialogues.

March 13, 2026Open Access

Ensemble-Based Multi-Class and Multi-Label Text Classification for Noisy Clinical Dialogues

Key Points

To assess the effectiveness of an ensemble-based approach for classifying noisy clinical dialogues.
Developed ensemble using three fine-tuned Polish T5 models.
Trained on partially overlapping clinical dialogue datasets.
Evaluated on low-quality, automatically transcribed conversations.
Achieved a 21.8% increase in F1-score for internal medicine dialogues.
Registered a 44.9% increase in F1-score for pediatric interviews.
Outperformed the single best-performing model.

Abstract

Multi-class and multi-label classification of medical dialogues remains a challenging task due to high linguistic variability and transcription noise. This study proposes an ensemble approach based on three fine-tuned Polish T5 (Text-to-Text Transfer Transformer) models trained on partially overlapping clinical dialogue datasets. The models are evaluated exclusively on low-quality, highly noisy, automatically transcribed conversations to assess real-world robustness. The results demonstrate that the ensemble of models improves classification stability and outperforms the best single model, increasing the F1-score by 21.8% for internal medicine dialogues and by 44.9% for paediatric interviews. The proposed method shows potential for practical deployment in clinical decision support and automated medical documentation systems.

Ensemble-Based Multi-Class and Multi-Label Text Classification for Noisy Clinical Dialogues

Key Points

Abstract

Cite This Study