4.7 Article

Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING
卷 28, 期 1, 页码 265-278

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIP.2018.2867198

关键词

Object detection; convolutional neural networks; Fisher discrimination criterion; rotation invariance

资金

  1. National Science Foundation of China [61772425, 61473231, 61790552]
  2. Natural Science Basic Research Plan in Shaanxi Province of China [2017JM6044, 2018KJXX-029]
  3. Aerospace Science Foundation of China [2017ZC53032]
  4. Fundamental Research Funds for the Central Universities [3102018zy023]
  5. Australian Research Council [FT180100116]

向作者/读者索取更多资源

The performance of object detection has recently been significantly improved due to the powerful features learnt through convolutional neural networks (CNNs). Despite the remarkable success, there are still several major challenges in object detection, including object rotation, within-class diversity, and between-class similarity, which generally degenerate object detection performance. To address these issues, we build up the existing state-of-the-art object detection systems and propose a simple but effective method to train rotation-invariant and Fisher discriminative CNN models to further boost object detection performance. This is achieved by optimizing a new objective function that explicitly imposes a rotation-invariant regularizer and a Fisher discrimination regularizer on the CNN features. Specifically, the first regularizer enforces the CNN feature representations of the training samples before and after rotation to be mapped closely to each other in order to achieve rotation-invariance. The second regularizer constrains the CNN features to have small within-class scatter but large between-class separation. We implement our proposed method under four popular object detection frameworks, including region-CNN (R-CNN), Fast R-CNN, Faster R-CNN, and R-FCN. In the experiments, we comprehensively evaluate the proposed method on the PASCAL VOC 2007 and 2012 data sets and a publicly available aerial image data set. Our proposed methods outperform the existing baseline methods and achieve the state-of-the-art results.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据