While convolutional neural networks (CNNs) excel at image classification, their generalization across domains and robustness to nonlinear degradation remain challenges in art media classification (AMC). To address these challenges, this article presents a dual-stage analytical framework: first, an evaluation of seven discrete CNN architectures—ranging from VGG16 to ConvNeXt—subjected to domain shift using the New Spain (Mexico) Art Media Dataset; and second, a formal robustness analysis using an artistic corruption benchmark (Art-C). This benchmark simulates nonlinear degradations, including cracking, oxidized varnish, and pictorial abstraction. Our results demonstrate that while deep convolutional representations maintain acceptable transferability (accuracy >70%), significant variability exists in architectural stability (mean 0.0607) under progressive stochastic degradation. Notably, Xception exhibited the highest robustness (Art-mCE = 0.8039), whereas VGG16 showed the greatest relative performance decay. Severity analysis further indicates that structural perturbations induce higher error rates than chromatic shifts, suggesting that CNNs are more sensitive to topological features (depth and residual connections) than color-space distributions. We provide quantitative evidence characterizing the relationship between architectural topology and empirical stability in non-natural image domains.
Fortuna-Cervantes et al. (Thu,) studied this question.