4.6 Review

Black-box error diagnosis in Deep Neural Networks for computer vision: a survey of tools

期刊

NEURAL COMPUTING & APPLICATIONS
卷 35, 期 4, 页码 3041-3062

出版社

SPRINGER LONDON LTD
DOI: 10.1007/s00521-022-08100-9

关键词

Black-box; Error diagnosis; Machine learning; Evaluation; Metrics

向作者/读者索取更多资源

The application of DNNs to various tasks requires methods to deal with their complex and opaque nature. Evaluating performance beyond standard metrics, understanding model behavior, and diagnosing prediction errors can be achieved through model interpretation and black-box error diagnosis techniques. Both approaches provide insights for improving the architecture and training process.
The application of Deep Neural Networks (DNNs) to a broad variety of tasks demands methods for coping with the complex and opaque nature of these architectures. When a gold standard is available, performance assessment treats the DNN as a black box and computes standard metrics based on the comparison of the predictions with the ground truth. A deeper understanding of performances requires going beyond such evaluation metrics to diagnose the model behavior and the prediction errors. This goal can be pursued in two complementary ways. On one side, model interpretation techniques open the box and assess the relationship between the input, the inner layers and the output, so as to identify the architecture modules most likely to cause the performance loss. On the other hand, black-box error diagnosis techniques study the correlation between the model response and some properties of the input not used for training, so as to identify the features of the inputs that make the model fail. Both approaches give hints on how to improve the architecture and/or the training process. This paper focuses on the application of DNNs to computer vision (CV) tasks and presents a survey of the tools that support the black-box performance diagnosis paradigm. It illustrates the features and gaps of the current proposals, discusses the relevant research directions and provides a brief overview of the diagnosis tools in sectors other than CV.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据