Accurate classification of brain tumors from MRI scans remains challenging due to limited annotated data. This study compares data-efficient paradigms—few-shot learning (FSL) and zero-shot learning (ZSL)—for tumor diagnosis using deep learning and vision–language models. A Prototypical Network (ProtoNet) with CNN, ResNet-18, and vision transformer backbones was evaluated under 1000 randomly sampled five-shot, four-way episodes (mean ± SD). The ResNet-18 ProtoNet achieved 85% ± 8% accuracy (F1 = 0.85), surpassing a fine-tuned ResNet-50 baseline (42% ± 12%) and the CLIP (ZSL) model (30% ± 10%). A visual-only ZSL baseline without text guidance achieved 54% ± 11%. These results highlight that metric-based FSL offers 43% absolute improvement over standard fine-tuning and establishes a robust benchmark for data-efficient MRI classification under severe label constraints.
Aparna Das (Tue,) studied this question.