4.7 Article

Explainable AI to improve acceptance of convolutional neural networks for automatic classification of dopamine transporter SPECT in the diagnosis of clinically uncertain parkinsonian syndromes

Journal

Publisher

SPRINGER
DOI: 10.1007/s00259-021-05569-9

Keywords

Convolutional neural network; Explainable AI; Relevance propagation; Parkinson's disease; Dopamine transporter; SPECT

Funding

  1. European Union Horizon 2020 research and innovation program under the Marie Skodowska-Curie grant [764458]

Ask authors/readers for more resources

This study tested layer-wise relevance propagation (LRP) to explain the CNN-based classification of dopamine transporter (DAT) SPECT images in patients with clinically uncertain parkinsonian syndromes. The results showed that LRP provided relevant maps that were easy to interpret and highlighted the key brain regions for CNN-based classification. Overall, LRP is recommended to support CNN-based classification of DAT-SPECT in clinical routine, with a total computation time of 3 seconds which is compatible with busy clinical workflow. The utility of inconsistent relevance maps to identify misclassified cases requires further investigation.
Purpose Deep convolutional neural networks (CNN) provide high accuracy for automatic classification of dopamine transporter (DAT) SPECT images. However, CNN are inherently black-box in nature lacking any kind of explanation for their decisions. This limits their acceptance for clinical use. This study tested layer-wise relevance propagation (LRP) to explain CNN-based classification of DAT-SPECT in patients with clinically uncertain parkinsonian syndromes. Methods The study retrospectively included 1296 clinical DAT-SPECT with visual binary interpretation as normal or reduced by two experienced readers as standard-of-truth. A custom-made CNN was trained with 1008 randomly selected DAT-SPECT. The remaining 288 DAT-SPECT were used to assess classification performance of the CNN and to test LRP for explanation of the CNN-based classification. Results Overall accuracy, sensitivity, and specificity of the CNN were 95.8%, 92.8%, and 98.7%, respectively. LRP provided relevance maps that were easy to interpret in each individual DAT-SPECT. In particular, the putamen in the hemisphere most affected by nigrostriatal degeneration was the most relevant brain region for CNN-based classification in all reduced DAT-SPECT. Some misclassified DAT-SPECT showed an inconsistent relevance map more typical for the true class label. Conclusion LRP is useful to provide explanation of CNN-based decisions in individual DAT-SPECT and, therefore, can be recommended to support CNN-based classification of DAT-SPECT in clinical routine. Total computation time of 3 s is compatible with busy clinical workflow. The utility of inconsistent relevance maps to identify misclassified cases requires further investigation.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available