☆ 4.8 Article

SIGN: Statistical Inference Graphs Based on Probabilistic Network Activity Interpretation

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Journal

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

Volume 45, Issue 3, Pages 3783-3797

Publisher

IEEE COMPUTER SOC

DOI: 10.1109/TPAMI.2022.3181472

Keywords

Visualization; Hidden Markov models; Training; Neural networks; Probabilistic logic; Convolutional neural networks; Network architecture; Deep neural networks; convolutional neural networks; visualization; interpretable AI; explainable AI; XAI; statistical inference

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Convolutional neural networks (CNNs) have achieved high accuracy in visual-related tasks, but interpreting their intermediate layers and understanding their decision-making process is challenging. The SIGN method introduces probabilistic models to model the hidden layer activity in CNNs. The models estimate transition probabilities between clusters in consecutive layers and identify paths of inference. These inference graphs help understand the overall inference process and explain the network's decisions about specific images.

Convolutional neural networks (CNNs) have achieved superior accuracy in many visual-related tasks. However, the inference process through a CNN's intermediate layers is opaque, making it difficult to interpret such networks or develop trust in their operation. In this article, we introduce SIGN method for modeling the network's hidden layer activity using probabilistic models. The activity patterns in layers of interest are modeled as Gaussian mixture models, and transition probabilities between clusters in consecutive modeled layers are estimated to identify paths of inference. For fully connected networks, the entire layer activity is clustered, and the resulting model is a hidden Markov model. For convolutional layers, spatial columns of activity are clustered, and a maximum likelihood model is developed for mining an explanatory inference graph. The graph describes the hierarchy of activity clusters most relevant for network prediction. We show that such inference graphs are useful for understanding the general inference process of a class, as well as explaining the (correct or incorrect) decisions the network makes about specific images. In addition, SIGN provide interesting observations regarding hidden layer activity in general, including the concentration of memorization in a single middle layer in fully connected networks, and a highly local nature of column activities in the top CNN layers.

SIGN: Statistical Inference Graphs Based on Probabilistic Network Activity Interpretation

Journal

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

Publisher

IEEE COMPUTER SOC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

SIGN: Statistical Inference Graphs Based on Probabilistic Network Activity Interpretation

Journal

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

Publisher

IEEE COMPUTER SOC

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper