4.7 Article

High-Quality Proposals for Weakly Supervised Object Detection

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING
卷 29, 期 -, 页码 5794-5804

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIP.2020.2987161

关键词

Proposals; Training; Detectors; Object detection; Search problems; Task analysis; Convolutional neural networks; Weakly supervised object detection (WSOD); proposal generation; proposal selection; convolutional neural networks (CNNs)

资金

  1. National Science Foundation of China [61772425, 61773315, 61790552, U1801265]
  2. Key RAMP
  3. D Program of Guangdong Province [2019B010110001]
  4. Fundamental Research Funds for the Central Universities [3102019AX09]
  5. Research Funds for Interdisciplinary subject, NWPU

向作者/读者索取更多资源

Despite significant efforts made so far for Weakly Supervised Object Detection (WSOD), proposal generation and proposal selection are still two major challenges. In this paper, we focus on addressing the two challenges by generating and selecting high-quality proposals. To be specific, for proposal generation, we combine selective search and a Gradient-weighted Class Activation Mapping (Grad-CAM) based technique to generate more proposals having higher Intersection-Over-Union (IOU) with ground truth boxes than those obtained by greedy search approaches, which can better envelop the entire objects. As regards proposal selection, for each object class, we choose as many confident positive proposals as possible and meanwhile only select class-specific hard negatives to focus training on more discriminative negative proposals by up-weighting their losses, which can make training more effective. The proposed proposal generation and proposal selection approaches are generic and thus can be broadly applied to many WSOD methods. In this work, we unify them into the framework of Online Instance Classifier Refinement (OICR). Experimental results on the PASCAL VOC 2007 and 2012 datasets and MS COCO dataset demonstrate that our method significantly improves the baseline method OICR by large margins (13.4% mAP and 11.6% CorLoc gains on the VOC 2007 dataset, 15.0% mAP and 8.9% CorLoc gains on the VOC 2012 dataset, and 6.4% mAP and 5.0% CorLoc gains on the COCO dataset) and achieves the state-of-the-art results compared with existing methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据