4.7 Article

Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection

Journal

IEEE TRANSACTIONS ON IMAGE PROCESSING
Volume 28, Issue 1, Pages 265-278

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIP.2018.2867198

Keywords

Object detection; convolutional neural networks; Fisher discrimination criterion; rotation invariance

Funding

  1. National Science Foundation of China [61772425, 61473231, 61790552]
  2. Natural Science Basic Research Plan in Shaanxi Province of China [2017JM6044, 2018KJXX-029]
  3. Aerospace Science Foundation of China [2017ZC53032]
  4. Fundamental Research Funds for the Central Universities [3102018zy023]
  5. Australian Research Council [FT180100116]

Ask authors/readers for more resources

The performance of object detection has recently been significantly improved due to the powerful features learnt through convolutional neural networks (CNNs). Despite the remarkable success, there are still several major challenges in object detection, including object rotation, within-class diversity, and between-class similarity, which generally degenerate object detection performance. To address these issues, we build up the existing state-of-the-art object detection systems and propose a simple but effective method to train rotation-invariant and Fisher discriminative CNN models to further boost object detection performance. This is achieved by optimizing a new objective function that explicitly imposes a rotation-invariant regularizer and a Fisher discrimination regularizer on the CNN features. Specifically, the first regularizer enforces the CNN feature representations of the training samples before and after rotation to be mapped closely to each other in order to achieve rotation-invariance. The second regularizer constrains the CNN features to have small within-class scatter but large between-class separation. We implement our proposed method under four popular object detection frameworks, including region-CNN (R-CNN), Fast R-CNN, Faster R-CNN, and R-FCN. In the experiments, we comprehensively evaluate the proposed method on the PASCAL VOC 2007 and 2012 data sets and a publicly available aerial image data set. Our proposed methods outperform the existing baseline methods and achieve the state-of-the-art results.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available