Key points are not available for this paper at this time.
The interpretability of the convolutional neural networks(CNNs) has become a research hotspot. A popular explanation method is based on Class Activation Mapping (CAM), which visualizes the salient regions most relevant to neural network decisions. However, many CAM methods use the feature maps produced by the final convolution layer to generate the class activation maps, which usually have a low spatial resolution and can only generate coarse-grained visual explanations that provide a rough spatial location of the target object. In this paper, we propose a novel CAM method named Fusion-CAM. It improves traditional CAM methods by combining final class activation map containing semantic information with intermediate layer class activation maps containing fine-grained details, to generate fine-grained visual explanations with high faithfulness. In order to obtain high-quality intermediate layer class activation maps, we utilize Layer-wise Relevance Propagation (LRP) to obtain the weighting components of each channel of the intermediate layer feature maps, and the intermediate layer class activation maps generated by weighted summation are less noisy and have clear fine-grained details, which help to improve the quality of the final class activation map. Qualitative and quantitative experiments show that Fusion-CAM can be easily attached to different CAM methods to improve their performance.
Chen et al. (Fri,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: