☆ 4.7 Article

Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection

IEEE TRANSACTIONS ON IMAGE PROCESSING (2019)

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING

卷 28, 期 1, 页码 265-278

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TIP.2018.2867198

关键词

Object detection; convolutional neural networks; Fisher discrimination criterion; rotation invariance

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic

资金

National Science Foundation of China [61772425, 61473231, 61790552]
Natural Science Basic Research Plan in Shaanxi Province of China [2017JM6044, 2018KJXX-029]
Aerospace Science Foundation of China [2017ZC53032]
Fundamental Research Funds for the Central Universities [3102018zy023]
Australian Research Council [FT180100116]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The performance of object detection has recently been significantly improved due to the powerful features learnt through convolutional neural networks (CNNs). Despite the remarkable success, there are still several major challenges in object detection, including object rotation, within-class diversity, and between-class similarity, which generally degenerate object detection performance. To address these issues, we build up the existing state-of-the-art object detection systems and propose a simple but effective method to train rotation-invariant and Fisher discriminative CNN models to further boost object detection performance. This is achieved by optimizing a new objective function that explicitly imposes a rotation-invariant regularizer and a Fisher discrimination regularizer on the CNN features. Specifically, the first regularizer enforces the CNN feature representations of the training samples before and after rotation to be mapped closely to each other in order to achieve rotation-invariance. The second regularizer constrains the CNN features to have small within-class scatter but large between-class separation. We implement our proposed method under four popular object detection frameworks, including region-CNN (R-CNN), Fast R-CNN, Faster R-CNN, and R-FCN. In the experiments, we comprehensively evaluate the proposed method on the PASCAL VOC 2007 and 2012 data sets and a publicly available aerial image data set. Our proposed methods outperform the existing baseline methods and achieve the state-of-the-art results.

Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文