☆ 4.5 Article

Corpus-level and Concept-based Explanations for Interpretable Document Classification

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA (2022)

期刊

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA

卷 16, 期 3, 页码 -

出版社

ASSOC COMPUTING MACHINERY

DOI: 10.1145/3477539

关键词

Attention mechanism; model interpretation; document classification; sentiment classification; concept-based explanation

类别

Computer Science, Information Systems Computer Science, Software Engineering

资金

US National Science Foundation [IIS-1707498, IIS-1838730]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study introduces an explanation method that captures causal relationships between keywords and model predictions by learning the importance of keywords for predicted labels across a training corpus based on attention weights. It can automatically learn higher-level concepts and their importance to model prediction tasks.

Using attention weights to identify information that is important for models' decision making is a popular approach to interpret attention-based neural networks. This is commonly realized in practice through the generation of a heat-map for every single document based on attention weights. However, this interpretation method is fragile and it is easy to find contradictory examples. In this article, we propose a corpus-level explanation approach, which aims at capturing causal relationships between keywords and model predictions via learning the importance of keywords for predicted labels across a training corpus based on attention weights. Based on this idea, we further propose a concept-based explanation method that can automatically learn higher level concepts and their importance to model prediction tasks. Our concept-based explanation method is built upon a novel Abstraction-Aggregation Network (AAN), which can automatically cluster important keywords during an end-to-end training process. We apply these methods to the document classification task and show that they are powerful in extracting semantically meaningful keywords and concepts. Our consistency analysis results based on an attention-based Naive Bayes classifier (NBC) also demonstrate that these keywords and concepts are important for model predictions.

Corpus-level and Concept-based Explanations for Interpretable Document Classification

期刊

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Corpus-level and Concept-based Explanations for Interpretable Document Classification

期刊

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文