What question did this study set out to answer?

April 18, 2026Open Access

Secure yet fragile: adversarial vulnerabilities of federated vision–language models in medical AI

Key Points

The aim is to assess the adversarial vulnerabilities of federated vision-language models in medical image analysis.
Evaluated CLIP-based vision-language models using four federated optimization strategies.
Tested robustness against adversarial attacks like FGSM, PGD, BIM, and MI-FGSM.
Compared the performance of two test-time defenses, Test-Time Counter-Attack and CLIPure.
Adversarial perturbations caused significant accuracy degradation in federated models.
Higher attack success rates were observed, especially with iterative attacks.
CLIPure provided more consistent improvements in mitigating adversarial effects across datasets.

Abstract

Abstract Vision–Language Models (VLMs) enable powerful multimodal reasoning for medical image analysis, while federated learning allows collaborative training across institutions without sharing patient data. However, the adversarial robustness of federated medical VLMs remains largely unexplored. This work systematically evaluates the vulnerability of CLIP-based VLMs trained with four federated optimization strategies, FedAvg, FedProx, FedPer, and FedBN, on multiple medical datasets. We assess robustness under FGSM, PGD, BIM, and MI-FGSM attacks at varying strengths and show that client-level adversarial perturbations propagate through federated aggregation, causing severe accuracy degradation and high attack success rates, specially under iterative attacks. We further benchmark two training-free test-time defenses, Test-Time Counter-Attack (TTC) and CLIPure, and demonstrate that both mitigate adversarial effects, with CLIPure providing more consistent improvements across datasets and attack intensities. These results highlight fundamental robustness limitations of federated medical VLMs and underscore the need for effective defense mechanisms in distributed clinical deployments.

Bookmark

View Full Paper

Cite This Study

Fime et al. (Thu,) studied this question.

synapsesocial.com/papers/69e3205140886becb653f6fe https://doi.org/https://doi.org/10.1038/s41598-026-48102-4

Bookmark

View Full Paper