4.5 Article

Why is real-world visual object recognition hard?

期刊

PLOS COMPUTATIONAL BIOLOGY
卷 4, 期 1, 页码 -

出版社

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pcbi.0040027

关键词

-

资金

  1. NATIONAL EYE INSTITUTE [R01EY014970] Funding Source: NIH RePORTER

向作者/读者索取更多资源

Progress in understanding the brain mechanisms underlying vision requires the construction of computational models that not only emulate the brain's anatomy and physiology, but ultimately match its performance on visual tasks. In recent years, natural'' images have become popular in the study of vision and have been used to show apparently impressive progress in building such models. Here, we challenge the use of uncontrolled natural'' images in guiding that progress. In particular, we show that a simple V1-like model - a neuroscientist's null'' model, which should perform poorly at real-world visual object recognition tasks - outperforms state-of-the-art object recognition systems ( biologically inspired and otherwise) on a standard, ostensibly natural image recognition test. As a counterpoint, we designed a simpler'' recognition test to better span the real-world variation in object pose, position, and scale, and we show that this test correctly exposes the inadequacy of the V1-like model. Taken together, these results demonstrate that tests based on uncontrolled natural images can be seriously misleading, potentially guiding progress in the wrong direction. Instead, we reexamine what it means for images to be natural and argue for a renewed focus on the core problem of object recognition - real-world image variation.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据