4.6 Article

Deep saliency models : The quest for the loss function

期刊

NEUROCOMPUTING
卷 453, 期 -, 页码 693-704

出版社

ELSEVIER
DOI: 10.1016/j.neucom.2020.06.131

关键词

Deep saliency models; Loss functions; Human visual saliency

向作者/读者索取更多资源

Deep learning techniques are widely used for modeling human visual saliency, with the choice of the loss function being a key factor that can significantly impact results. This study demonstrates that modifying the loss function on a fixed network architecture can lead to improvements or depreciation in performance. Combining several well-chosen loss functions in a linear combination can lead to significant improvements in performance on different datasets and network architectures.
Deep learning techniques are widely used to model human visual saliency, to such a point that state-ofthe-art performances are now only attained by deep neural networks. However, one key part of a typical deep learning model is often neglected when it comes to modeling visual saliency: the choice of the loss function. In this work, we explore some of the most popular loss functions that are used in deep saliency models. We demonstrate that on a fixed network architecture, modifying the loss function can significantly improve (or depreciate) the results, hence emphasizing the importance of the choice of the loss function when designing a model. We also evaluate the relevance of new loss functions for saliency prediction inspired by metrics used in style-transfer tasks. Finally, we show that a linear combination of several well-chosen loss functions leads to significant improvements in performance on different datasets as well as on a different network architecture, thus demonstrating the robustness of a combined metric. CO 2020 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据