4.8 Article

SIGN: Statistical Inference Graphs Based on Probabilistic Network Activity Interpretation

Journal

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TPAMI.2022.3181472

Keywords

Visualization; Hidden Markov models; Training; Neural networks; Probabilistic logic; Convolutional neural networks; Network architecture; Deep neural networks; convolutional neural networks; visualization; interpretable AI; explainable AI; XAI; statistical inference

Ask authors/readers for more resources

Convolutional neural networks (CNNs) have achieved high accuracy in visual-related tasks, but interpreting their intermediate layers and understanding their decision-making process is challenging. The SIGN method introduces probabilistic models to model the hidden layer activity in CNNs. The models estimate transition probabilities between clusters in consecutive layers and identify paths of inference. These inference graphs help understand the overall inference process and explain the network's decisions about specific images.
Convolutional neural networks (CNNs) have achieved superior accuracy in many visual-related tasks. However, the inference process through a CNN's intermediate layers is opaque, making it difficult to interpret such networks or develop trust in their operation. In this article, we introduce SIGN method for modeling the network's hidden layer activity using probabilistic models. The activity patterns in layers of interest are modeled as Gaussian mixture models, and transition probabilities between clusters in consecutive modeled layers are estimated to identify paths of inference. For fully connected networks, the entire layer activity is clustered, and the resulting model is a hidden Markov model. For convolutional layers, spatial columns of activity are clustered, and a maximum likelihood model is developed for mining an explanatory inference graph. The graph describes the hierarchy of activity clusters most relevant for network prediction. We show that such inference graphs are useful for understanding the general inference process of a class, as well as explaining the (correct or incorrect) decisions the network makes about specific images. In addition, SIGN provide interesting observations regarding hidden layer activity in general, including the concentration of memorization in a single middle layer in fully connected networks, and a highly local nature of column activities in the top CNN layers.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available