Computer vision is being revolutionized by the use of transformer-based machine learning architectures. However, these models need large datasets to enable pre-training through self-supervised learning. However, there is a lack of open-source datasets of the same magnitude as standard RGB color images. This work analyzes the effect of using randomly generated fractal-based hyperspectral images versus real data to understand the effect of pre-training dataset on a Swin image encoder model performance, during supervised-training of a semantic segmentation hyperspectral dataset. Two real data datasets are used for comparison to the synthetic dataset, one RGB-based and another hyperspectral-based to understand how variability in spectral resolution during pre-training effects model performance on semantic segmentation.
Medellin et al. (Tue,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: