What does this research mean for the field?

Vision Transformer (ViT)-based foundation models, such as UNI, Virchow2, and GigaPath, demonstrate the highest performance and best generalization across different scanning devices for content-based image retrieval in histopathology. Novelty: ClaimNovelty.INCREMENTAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

May 29, 2026Open Access

Reliability of foundation models for image retrieval in histopathology

Key Points

This study aims to explore covariate bias due to differences in scanning devices within foundation models for image retrieval in histopathology.
Introduced a dataset of spatially co-registered images from histopathology slides scanned by two different scanners.
Conducted a targeted analysis of scanner-induced variability in representations of foundation models.
Assessed performance of Vision Transformer-based architectures including UNI, Virchow2, and GigaPath.
Vision Transformer-based models showed superior performance and generalization across different scanners.
Identified significant covariate bias linked to scanning device differences affecting image retrieval effectiveness.

Abstract

Ensuring fairness and explainability is essential for the development of ethical, reliable, and effective AI systems in healthcare. Bias in AI models can contribute to disparities in clinical outcomes, challenging equity in medical decision-making. Content-Based Image Retrieval (CBIR) offers interpretable, visual tools to support diagnostic processes; however, these tools remain susceptible to biases inherent in the data. This study investigates covariate bias arising from differences in scanning devices within Foundation Models (FMs) used for CBIR in histopathology. We introduce a unique dataset comprising spatially co-registered images derived from the same histopathology slides scanned using two distinct scanners. This design enables a targeted analysis of scanner-induced variability in FM representations. Among the FMs assessed, Vision Transformer (ViT)-based architectures such as UNI, Virchow2, and GigaPath, demonstrated top performance and best generalization properties across scanners.

Bookmark

View Full Paper

Cite This Study

Shafique et al. (Tue,) studied this question.

synapsesocial.com/papers/6a192c8bfab5b468c44155a9 https://doi.org/https://doi.org/10.1038/s44303-026-00174-7

Bookmark

View Full Paper