4.7 Article

Salient Object Detection Using Cascaded Convolutional Neural Networks and Adversarial Learning

期刊

IEEE TRANSACTIONS ON MULTIMEDIA
卷 21, 期 9, 页码 2237-2247

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMM.2019.2900908

关键词

Salient object detection; cascaded convolutional neural networks; conditional generative adversarial networks; adversarial learning

资金

  1. Natural Science Foundation of China [61672194]
  2. National Key R&D Program of China [2018YFC0832304]
  3. Distinguished Youth Science Foundation of Heilongjiang Province of China [JC2018021]
  4. Shandong Provincial Natural Science Foundation, China [ZR2016FM04]
  5. Humanity and Social Science Youth Foundation of the Ministry of Education of China [14YJC760001]
  6. Open Foundation of the State Key Laboratory of Robotics and System [SKLRS-2019-KF-14]

向作者/读者索取更多资源

Salient object detection has received much attention and achieved great success in last several years. It is still challenging to get clear boundaries and consistent saliencies, which can be considered as the structural information of salient objects. A popular solution is to conduct some post-processes (e.g., conditional random field (CRF)) to refine these structural information. In this paper, a novel cascaded convolutional neural networks (CNNs) based method is proposed to implicitly learn these structural information via adversarial learning for salient object detection (we termed the proposed method as CCAL). A cascaded CNNs model is first designed as a generator G, which consists of an encoder-decoder network for global saliency estimation and a deep residual network for local saliency refinement. It is hard to explicitly learn such structural information due to the limitation of frequently-used pixel-wise loss functions. Instead, a discriminator D is then designed to distinguish the real salient maps (i.e., ground truths) from the fake ones produced by G, based on which an adversarial loss is introduced to optimize G. G and D are trained in a fully end-to-end fashion by following the strategy of conditional generative adversarial networks to make G well learn the structural information. At last, G is able to produce high quality salient maps without requiring any post-process to fool D. Experimental results on eight benchmark datasets demonstrate the effectiveness and efficiency (about 17 fps on graphics processing unit (GPU)) of the proposed method for salient object detection.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据