☆ 4.6 Article

Pooling in image representation: The visual codeword point of view

COMPUTER VISION AND IMAGE UNDERSTANDING (2013)

期刊

COMPUTER VISION AND IMAGE UNDERSTANDING

卷 117, 期 5, 页码 453-465

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

DOI: 10.1016/j.cviu.2012.09.007

关键词

Image classification; Image representation; Pattern recognition; Bag-of-Words; Visual dictionary; Coding; Pooling; SVM

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic

资金

CAPES/COFECUB [592/08/10]
CNPq [14.1312/2009-2]
ANR [07-MDCO-007-03]
FAPESP

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

In this work, we propose Bossallova, a novel representation for content-based concept detection in images and videos, which enriches the Bag-of-Words model. Relying on the quantization of highly discriminant local descriptors by a codebook, and the aggregation of those quantized descriptors into a single pooled feature vector, the Bag-of-Words model has emerged as the most promising approach for concept detection on visual documents. Bossallova enhances that representation by keeping a histogram of distances between the descriptors found in the image and those in the codebook, preserving thus important information about the distribution of the local descriptors around each codeword. Contrarily to other approaches found in the literature, the non-parametric histogram representation is compact and simple to compute. Bossallova compares well with the state-of-the-art in several standard datasets: MIRFLICKR, ImageCLEF 2011, PASCAL VOC 2007 and 15-Scenes, even without using complex combinations of different local descriptors. It also complements well the cutting-edge Fisher Vector descriptors, showing even better results when employed in combination with them. Bossallova also shows good results in the challenging real-world application of pornography detection. (C) 2012 Elsevier Inc. All rights reserved.

Pooling in image representation: The visual codeword point of view

期刊

COMPUTER VISION AND IMAGE UNDERSTANDING

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Pooling in image representation: The visual codeword point of view

期刊

COMPUTER VISION AND IMAGE UNDERSTANDING

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文