4.7 Article

ISAIR: Deep inpainted semantic aware image representation for background subtraction

Journal

EXPERT SYSTEMS WITH APPLICATIONS
Volume 207, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2022.117947

Keywords

Image representation; Background subtraction; Deep learning

Ask authors/readers for more resources

Nowadays, deep learning has significantly impacted computer vision applications by extracting essential features to solve specific problems. However, one drawback is the need for annotated training data. To address this, this paper proposes an image representation technique based on object detection and inpainting, which combines the generalization ability of traditional background subtraction techniques with the rich feature representation of deep learning. Experimental results show a 20% improvement in accuracy when using a conventional monocular camera.
Nowadays, deep learning is impacting computer vision applications significantly by learning to extract and describe the essential features that can assist in solving a specific problem. One of the main drawbacks of deep learning-based approaches is their requirement of annotated training data. For instance, in the area of background subtraction, a sequence of annotated training data that demonstrates the foreground and background of the scene is required; such a sequence is required from different scenarios the module performs. Hence, such methods cannot perform directly in new environments, and they expect training data from the new scene. In order to benefit from the high generalization ability and easy-to-use capability of traditional background subtraction techniques, and rich feature representation of deep learning approaches, this paper utilized deep learning based techniques and investigated the best image representation that can assist conventional background subtraction approaches. The proposed image representation is based on the detection of objects which are semantically moving and inpainting the parts of the images related to such objects. Experiments and results show that the proposed image representation can have the online and portable capability of traditional techniques and improve their accuracy by approximately 20% when a conventional type of monocular camera is used.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available