☆ 4.6 Article

Using the Forest to See the Trees: Exploiting Context for Visual Object Detection and Localization

COMMUNICATIONS OF THE ACM (2010)

期刊

COMMUNICATIONS OF THE ACM

卷 53, 期 3, 页码 107-114

出版社

ASSOC COMPUTING MACHINERY

DOI: 10.1145/1666420.1666446

关键词

类别

Computer Science, Hardware & Architecture Computer Science, Software Engineering Computer Science, Theory & Methods

资金

NGA [NEGI-1582-040004]
MURI [N00014-06-1-0734]
NSF [IIS 0747120, IIS-0413232]
National Defense Science and Engineering Graduate Fellowship
NSERC
CIFAR
Div Of Information & Intelligent Systems
Direct For Computer & Info Scie & Enginr [0747120] Funding Source: National Science Foundation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Recognizing objects in images is an active area of research in computer vision. In the last two decades, there has been much progress and there are already object recognition systems operating in commercial products. However, most of the algorithms for detecting objects perform an exhaustive search across all locations and scales in the image comparing local image regions with an object model. That approach ignores the semantic structure of scenes and tries to solve the recognition problem by brute force. In the real world, objects tend to covary with other objects, providing a rich collection of contextual associations. These contextual associations can be used to reduce the search space by looking only in places in which the object is expected to be; this also increases performance, by rejecting patterns that look like the target but appear in unlikely places. Most modeling attempts so far have defined the context of an object in terms of other previously recognized objects. The drawback of this approach is that inferring the context becomes as difficult as detecting each object. An alternative view of context relies on using the entire scene information holistically. This approach is algorithmically attractive since it dispenses with the need for a prior step of individual object recognition. In this paper, we use a probabilistic framework for encoding the relationships between context and object properties and we show how an integrated system provides improved performance. We view this as a significant step toward general purpose machine vision systems.

Using the Forest to See the Trees: Exploiting Context for Visual Object Detection and Localization

期刊

COMMUNICATIONS OF THE ACM

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Using the Forest to See the Trees: Exploiting Context for Visual Object Detection and Localization

期刊

COMMUNICATIONS OF THE ACM

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文