☆ 4.7 Review

Interpreting Adversarial Examples in Deep Learning: A Review

ACM COMPUTING SURVEYS (2023)

期刊

ACM COMPUTING SURVEYS

卷 55, 期 14S, 页码 -

出版社

ASSOC COMPUTING MACHINERY

DOI: 10.1145/3594869

关键词

Deep learning; adversarial example; interpretability; adversarial robustness

类别

Computer Science, Theory & Methods

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This article presents a framework that discusses recent works on theoretically explaining adversarial examples from three perspectives, instead of reviewing technical progress in adversarial attacks and defenses. Drawing on reviewed literature, this survey identifies current problems and challenges and highlights potential future research directions in investigating adversarial examples.

Deep learning technology is increasingly being applied in safety-critical scenarios but has recently been found to be susceptible to imperceptible adversarial perturbations. This raises a serious concern regarding the adversarial robustness of deep neural network (DNN)-based applications. Accordingly, various adversarial attacks and defense approaches have been proposed. However, current studies implement different types of attacks and defenses with certain assumptions. There is still a lack of full theoretical understanding and interpretation of adversarial examples. Instead of reviewing technical progress in adversarial attacks and defenses, this article presents a framework consisting of three perspectives to discuss recent works focusing on theoretically explaining adversarial examples comprehensively. In each perspective, various hypotheses are further categorized and summarized into several subcategories and introduced systematically. To the best of our knowledge, this study is the first to concentrate on surveying existing research on adversarial examples and adversarial robustness from the interpretability perspective. By drawing on the reviewed literature, this survey characterizes current problems and challenges that need to be addressed and highlights potential future research directions to further investigate adversarial examples.

Interpreting Adversarial Examples in Deep Learning: A Review

期刊

ACM COMPUTING SURVEYS

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Interpreting Adversarial Examples in Deep Learning: A Review

期刊

ACM COMPUTING SURVEYS

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文