March 3, 2026Open Access

Stylized Explanations: Enhancing Neural Networks Visual Interpretations Using Neural Style Transfer

Key Points

Stylized activation maps improved object localization, achieving higher IoU values compared to the input image activation map.
The approach used neural style transfer to create artistic versions of input images for better visual interpretation.
Experiments showed the input image activation maps outperformed stylized ones in faithfulness with lower confidence drop.
Sensitivity to VGG-16 model randomization was observed in both input and stylized activation maps during sanity checks.

Abstract

Neural networks have made significant achievements in areas like computer vision and natural language processing. However, there is an increasing demand to understand their outputs and decisions. They are considered black box models because it is hard to understand their complex architecture as they lack justification and transparency. In this paper, we present a novel approach by generating stylized explanations using neural style transfer. Unlike methods that use one input image to generate activation maps, we adopt the neural style transfer technique to generate artistic styles of the input image. We select eight styles from the STaDA experiments, “laₘuse”, “rainₚrincess”, “sunflower”, “theₛcream”, “theₛhipwreck”, “udnie”, “wave”, and “yourₙame”. We pass the stylized images with the input image to Score-CAM and produce class activation maps for the nine images, the input image and the eight stylized images. We evaluate the stylized activation maps by conducting experiments like faithfulness, object localization, and sanity check. The input image activation map outperformed stylized activation maps in terms of faithfulness with a lower confidence drop. However, in terms of object localization, “sunflower”, “theₛcream”, “theₛhipwreck”, “udnie”, “wave”, and “yourₙame” stylized activation maps outperformed the input image activation map with a higher IoU value. In terms of sanity check, both the input image activation map and stylized activation maps were sensitive to VGG-16 model randomization.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Rami Ibrahim (Thu,) studied this question.

synapsesocial.com/papers/69a75d15c6e9836116a268be https://doi.org/https://doi.org/10.1109/access.2026.3658548

Bookmark

View Full Paper