4.3 Article

A systematic comparison between visual cues for boundary detection

Journal

VISION RESEARCH
Volume 120, Issue -, Pages 93-107

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.visres.2015.11.007

Keywords

Boundary; Contour; Segmentation; Grouping; Early vision; Primary visual cortex; Natural scenes

Funding

  1. ONR grant [N000141110743]
  2. DARPA young faculty award [N66001-14-1-4037]
  3. NSF early career award [IIS-1252951]
  4. Center for Computation and Visualization (CCV) at Brown University

Ask authors/readers for more resources

The detection of object boundaries is a critical first step for many visual processing tasks. Multiple cues (we consider luminance, color, motion and binocular disparity) available in the early visual system may signal object boundaries but little is known about their relative diagnosticity and how to optimally combine them for boundary detection. This study thus aims at understanding how early visual processes inform boundary detection in natural scenes. We collected color binocular video sequences of natural scenes to construct a video database. Each scene was annotated with two full sets of ground-truth contours (one set limited to object boundaries and another set which included all edges). We implemented an integrated computational model of early vision that spans all considered cues, and then assessed their diagnosticity by training machine learning classifiers on individual channels. Color and luminance were found to be most diagnostic while stereo and motion were least. Combining all cues yielded a significant improvement in accuracy beyond that of any cue in isolation. Furthermore, the accuracy of individual cues was found to be a poor predictor of their unique contribution for the combination. This result suggested a complex interaction between cues, which we further quantified using regularization techniques. Our systematic assessment of the accuracy of early vision models for boundary detection together with the resulting annotated video dataset should provide a useful benchmark towards the development of higher-level models of visual processing. (C) 2016 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available