4.6 Article

COTS: A Multipurpose RGB-D Dataset for Saliency and Image Manipulation Applications

Journal

IEEE ACCESS
Volume 9, Issue -, Pages 21481-21497

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2021.3055647

Keywords

Saliency detection; Object detection; Image segmentation; Lighting; Computer vision; Cameras; Semantics; Dataset; RGB-D; salient object detection; inpainting; blending; segmentation

Funding

  1. Malta Government Scholarships Scheme (MGSS)

Ask authors/readers for more resources

This paper introduces an RGB-D dataset designed for applications involving salient object detection, segmentation, inpainting, and blending techniques. The dataset fills a gap in the evaluation of image inpainting and blending applications, allowing for experiments to evaluate these different applications. Results demonstrate novel possibilities for the evaluation of computer vision applications.
Many modern computer vision systems include several modules that perform different processing operations packaged as a single pipeline architecture. This generally introduces a challenge in the evaluation process since most datasets provide evaluation data for just one of the operations. In this paper, we present an RGB-D dataset that was designed from first principles to cater for applications that involve salient object detection, segmentation, inpainting and blending techniques. This addresses a gap in the evaluation of image inpainting and blending applications that generally rely on subjective evaluation due to the lack of availability of comparative data. A set of experiments were carried out to demonstrate how the COTS dataset can be used to evaluate these different applications. This dataset includes a variety of scenes, where each scene is captured multiple times, each time adding a new object to the previous scene. This allows for a comparative analysis at pixel level in image inpainting and blending applications. Moreover, all objects were manually labeled in order to offer the possibility of salient object detection even in scenes that contain multiple objects. An online test with 1267 participants was also carried out, and this dataset also includes the click coordinates of users' selection for every image, introducing a user interaction dimension in the same RGB-D dataset. This dataset was also validated using state of the art techniques, obtaining an F-beta of 0.957 in salient object detection and a mean (Intersection over Union) IoU of 0.942 in Segmentation. Results demonstrate that the COTS dataset introduces novel possibilities for the evaluation of computer vision applications.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available