☆ 4.3 Article

Multi-modal deep network for RGB-D segmentation of clothes

ELECTRONICS LETTERS (2020)

Journal

ELECTRONICS LETTERS

Volume 56, Issue 9, Pages 432-434

Publisher

WILEY

DOI: 10.1049/el.2019.4150

Keywords

image fusion; learning (artificial intelligence); image segmentation; image colour analysis; synthetic data; real-world data; multimodal deep network; RGB-D segmentation; clothes; deep learning; semantic segmentation; synthetic dataset; different clothing styles; semantic classes; data generation pipeline; depth images; ground-truth label maps; novel multimodal encoder-ecoder convolutional network; depth modalities; multimodal features; trained fusion modules; multiscale atrous convolutions

Funding

[BRGRD24]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

In this Letter, the authors propose a deep learning based method to perform semantic segmentation of clothes from RGB-D images of people. First, they present a synthetic dataset containing more than 50,000 RGB-D samples of characters in different clothing styles, featuring various poses and environments for a total of nine semantic classes. The proposed data generation pipeline allows for fast production of RGB, depth images and ground-truth label maps. Secondly, a novel multi-modal encoder-ecoder convolutional network is proposed which operates on RGB and depth modalities. Multi-modal features are merged using trained fusion modules which use multi-scale atrous convolutions in the fusion process. The method is numerically evaluated on synthetic data and visually assessed on real-world data. The experiments demonstrate the efficiency of the proposed model over existing methods.

Multi-modal deep network for RGB-D segmentation of clothes

Journal

ELECTRONICS LETTERS

Publisher

WILEY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Multi-modal deep network for RGB-D segmentation of clothes

Journal

ELECTRONICS LETTERS

Publisher

WILEY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper