☆ 4.7 Article

Coarse-to-Fine Description for Fine-Grained Visual Categorization

IEEE TRANSACTIONS ON IMAGE PROCESSING (2016)

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING

卷 25, 期 10, 页码 4858-4872

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TIP.2016.2599102

关键词

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic

资金

National High Technology Research and Development Program of China [2014AA015202]
National Nature Science Foundation of China [61525206, 61428207, 61572050, 91538111, 61429201]
Beijing Advanced Innovation Center for Imaging Technology [BAICIT-2016009]
ARO [W911NF-15-1-0290]
Faculty Research Gift Awards by NEC Laboratories of America
Blippar

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Recent years have witnessed the significant advance in fine-grained visual categorization, which targets to classify the objects belonging to the same species. To capture enough subtle visual differences and build discriminative visual description, most of the existing methods heavily rely on the artificial part annotations, which are expensive to collect in real applications. Motivated to conquer this issue, this paper proposes a multilevel coarse-to-fine object description. This novel description only requires the original image as input, but could automatically generate visual descriptions discriminative enough for fine-grained visual categorization. This description is extracted from five sources representing coarse-to-fine visual clues: 1) original image is used as the source of global visual clue; 2) object bounding boxes are generated using convolutional neural network (CNN); 3) with the generated bounding box, foreground is segmented using the proposed k nearest neighbour-based co-segmentation algorithm; and 4) two types of part segmentations are generated by dividing the foreground with an unsupervised part learning strategy. The final description is generated by feeding these sources into CNN models and concatenating their outputs. Experiments on two public benchmark data sets show the impressive performance of this coarse-to-fine description, i.e., classification accuracy achieves 82.5% on CUB-200-2011, and 86.9% on fine-grained visual categorization-Aircraft, respectively, which outperform many recent works.

Coarse-to-Fine Description for Fine-Grained Visual Categorization

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Coarse-to-Fine Description for Fine-Grained Visual Categorization

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文