4.4 Article

Grad-CAM helps interpret the deep learning models trained to classify multiple sclerosis types using clinical brain magnetic resonance imaging

Journal

JOURNAL OF NEUROSCIENCE METHODS
Volume 353, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.jneumeth.2021.109098

Keywords

Deep learning; Convolutional neural network; Model interpretation; Magnetic resonance imaging; Heatmap; Class activation mapping; CAM; Gradient-GAM; Disease type; Grad-CAM plus plus; Multiple sclerosis

Funding

  1. MS Society of Canada
  2. Calgary Clinical MS Research Program

Ask authors/readers for more resources

This study investigated three heatmap-generating techniques for CNN interpretation, with Grad-CAM showing the best ability in heatmap localization. Through training and examining six different models, the impact of CNNs on heatmap generation was demonstrated. By analyzing the performance of the VGG19 model, differences in the brain regions between SPMS and RRMS were identified.
Background: Deep learning using convolutional neural networks (CNNs) has shown great promise in advancing neuroscience research. However, the ability to interpret the CNNs lags far behind, confounding their clinical translation. New method: We interrogated 3 heatmap-generating techniques that have increasing generalizability for CNN interpretation: class activation mapping (CAM), gradient (Grad)-CAM, and Grad-CAM++. To investigate the impact of CNNs on heatmap generation, we also examined 6 different models trained to classify brain magnetic resonance imaging into 3 types: relapsing-remitting multiple sclerosis (RRMS), secondary progressive MS (SPMS), and control. Further, we designed novel methods to visualize and quantify the heatmaps to improve interpretability. Results: Grad-CAM showed the best heatmap localizing ability, and CNNs with a global average pooling layer and pretrained weights had the best classification performance. Based on the best-performing CNN model, called VGG19, the 95th percentile values of Grad-CAM in SPMS were significantly higher than RRMS, indicating greater heterogeneity. Further, voxel-wise analysis of the thresholded Grad-CAM confirmed the difference identified visually between RRMS and SPMS in discriminative brain regions: occipital versus frontal and occipital, or temporal/parietal. Comparison with existing methods: No study has examined the CAM methods together using clinical images. There is also lack of study on the impact of CNN architecture on heatmap outcomes, and of technologies to quantify heatmap patterns in clinical settings. Conclusions: Grad-CAM outperforms CAM and Grad-CAM++. Integrating Grad-CAM, novel heatmap quantification approaches, and robust CNN models may be an effective strategy in identifying the most crucial brain areas underlying disease development in MS.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available