4.7 Article

IoU-uniform R-CNN: Breaking through the limitations of RPN

期刊

PATTERN RECOGNITION
卷 112, 期 -, 页码 -

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2021.107816

关键词

Object detection; Two-stage detector; RPN; IoU distribution imbalance

资金

  1. National Natural Science Foundation of China [61772213, 61976227, 91748204]

向作者/读者索取更多资源

Region Proposal Network (RPN) is crucial in two-stage object detectors for generating object proposals and addressing foreground-background class imbalance during training. However, limitations such as IoU distribution imbalance and inadequate training samples generated by RPN have not fully exploited the detector's potential. To overcome these limitations, a simple but effective method called IoU-Uniform R-CNN is proposed to generate training samples with uniform IoU distribution, improving detection performance and compatibility with various object detection architectures.
Region Proposal Network (RPN) is the cornerstone of two-stage object detectors. It generates a sparse set of object proposals and alleviates the extrem foreground-background class imbalance problem during training. However, we find that the potential of the detector has not been fully exploited due to the IoU distribution imbalance and inadequate quantity of the training samples generated by RPN. With the increasing intersection over union (IoU), the exponentially smaller numbers of positive samples would lead to the distribution skewed towards lower IoUs, which hinders the optimization of detector at high IoU levels. In this paper, to break through the limitations of RPN, we propose IoU-Uniform R-CNN, a simple but effective method that directly generates training samples with uniform IoU distribution for the regression branch as well as the IoU prediction branch. Besides, we improve the performance of IoU prediction branch by eliminating the feature offsets of RoIs at inference, which helps the NMS procedure by preserving accurately localized bounding box. Extensive experiments on the PASCAL VOC and MS COCO dataset show the effectiveness of our method, as well as its compatibility and adaptivity to many object detection architectures. The code is made publicly available at https://github.com/zl1994/IoU-Uniform-R-CNN. (c) 2021 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据