☆ 4.7 Article

Learning descriptive visual representation for image classification and annotation

PATTERN RECOGNITION (2015)

期刊

PATTERN RECOGNITION

卷 48, 期 2, 页码 498-508

出版社

ELSEVIER SCI LTD

DOI: 10.1016/j.patcog.2014.08.008

关键词

Image classification; Image annotation; Visual representation; Matrix factorization

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic

资金

National Natural Science Foundation of China [61202231, 61222307]
National Key Basic Research Program (973 Program) of China [2014CB340403]
Beijing Natural Science Foundation of China [4132037]
Ph.D. Programs Foundation of Ministry of Education of China [20120001120130]
Fundamental Research Funds for the Central Universities
Research Funds of Renmin University of China [14XNLF04]
Microsoft Research Asia

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

This paper presents a novel semantic regularized matrix factorization method for learning descriptive visual bag-of-words (BOW) representation. Although very influential in image classification, the traditional visual BOW representation has one distinct drawback. That is, for efficiency purposes, this visual representation is often generated by directly clustering the low-level visual feature vectors extracted from local keypoints or regions, without considering the high-level semantics of images. In other words, it still suffers from the semantic gap and may lead to significant performance degradation in more challenging tasks, e.g., image classification over social collections with large intra-class variations. To learn descriptive visual BOW representation for such image classification task, we develop a semantic regularized matrix factorization method by adding Laplacian regularization defined with the tags (easy to access) of social images into matrix factorization. Moreover, given that image annotation only provides the tags of training images in advance (while the tags of all social images are available), we can readily apply the proposed method to image annotation by first running a round of image annotation to predict the tags (maybe incorrect) of test images and thus obtaining the tags of all images. Experimental results show the promising performance of the proposed method. (C) 2014 Elsevier Ltd. All rights reserved.

Learning descriptive visual representation for image classification and annotation

期刊

PATTERN RECOGNITION

出版社

ELSEVIER SCI LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Learning descriptive visual representation for image classification and annotation

期刊

PATTERN RECOGNITION

出版社

ELSEVIER SCI LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文