What question did this study set out to answer?

This research aims to enhance automated brain tumor classification using advanced machine learning techniques.

May 6, 2026Open Access

A Verifiable Framework for Brain Tumor Classification: Combining Vision Transformers, Class-Weighted Learning, and SMT-Based Formal Decision Traces

Key Points

This research aims to enhance automated brain tumor classification using advanced machine learning techniques.
Developed a four-class slice-level classification framework.
Utilized a fine-tuned Swin-Tiny Transformer combined with class-weighted learning and symbolic auditing.
Evaluated architecture under consistent preprocessing, optimization, and evaluation methods.
Achieved 97.42% slice-level accuracy on a clinical dataset with Swin-Tiny.
Demonstrated stability with mean accuracy of 97.40% ± 0.28% through five-fold cross-validation.
Flagged a small percentage of predictions as constraint-violating, reflecting heuristic thresholds rather than model failure.

Abstract

Background/Objectives: Automated brain tumor classification from MRI is particularly challenging when restricted to single post-contrast axial T1-weighted slices without volumetric or clinical context. Methods: We present a four-class (glioma, meningioma, pituitary tumor, no tumor) slice-level classification framework that combines a fine-tuned Swin-Tiny Transformer with inverse-frequency class-weighted learning and a prototype SMT-based symbolic auditing layer for post hoc logical consistency checks. All architectures were trained and evaluated under identical preprocessing, augmentation, optimization, and evaluation protocols. Results: On an internal clinical dataset from Bandırma Onyedi Eylül University Hospital (n = 8040 slices), Swin-Tiny achieved 97.42% slice-level accuracy (macro-F1 97.42%, macro-AUC 0.994), exceeding matched convolutional baselines by approximately eight percentage points. Five-fold stratified cross-validation confirmed stability (mean accuracy 97.40% ± 0.28%). Zero-shot evaluation on the independent BRISC-2025 dataset (n = 6000 slices) yielded 94.82% accuracy and macro-AUC 0.97, indicating maintained performance under acquisition-related distribution shift. Per-class metrics were consistently high across tumor types, with residual errors dominated by glioma–meningioma confusion, reflecting known radiologic overlap on single contrast-enhanced T1 slices. The symbolic auditing layer flagged 1.2–2.9% of predictions as constraint-violating; most such cases were borderline but correctly classified, suggesting sensitivity of heuristic thresholds rather than systematic model failure. Conclusions: These findings support the value of hierarchical shifted-window attention for integrating local texture and broader spatial context in slice-level MRI classification. While patient-wise, multimodal, and prospective validation remain necessary for clinical deployment, this study provides a controlled empirical benchmark and a prototype mechanism for post hoc logical auditing in neuro-oncologic imaging.

Read Full Paperexternally

Demander à l'IA

Bookmark

View Full Paper