4.7 Article

LISU: Low-light indoor scene understanding with joint learning of reflectance restoration

Journal

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING
Volume 183, Issue -, Pages 470-481

Publisher

ELSEVIER
DOI: 10.1016/j.isprsjprs.2021.11.010

Keywords

Semantic segmentation; Deep learning; Intrinsic image decomposition; Low-light

Funding

  1. European Unions Horizon 2020 Research and Innovation Programme
  2. Korean Gov-ernment [833435]

Ask authors/readers for more resources

This paper presents a novel cascade network for studying semantic segmentation in low-light indoor environments, utilizing real and rendered images datasets. The proposed method achieves high accuracy in segmentation by decomposing low-light images and incorporating illumination invariant features. The results also demonstrate the importance of semantic information in enhancing reflectance restoration and segmentation accuracy.
Semantic segmentation using convolutional neural networks (CNNs) achieves higher accuracy than traditional methods, but it fails to yield satisfactory results under illumination variants when the training set is limited. In this paper we present a new data set containing both real and rendered images and a novel cascade network to study semantic segmentation in low-light indoor environments. Specifically, the network decomposes a low-light image into illumination and reflectance components, and then a multi-tasking learning scheme is built. One branch learns to reduce noise and restore information on the reflectance (reflectance restoration branch). Another branch learns to segment the reflectance map (semantic segmentation branch). The CNN features from two tasks are concatenated together so as to improve the segmentation accuracy by embedding the illumination invariant features. We compare our approach with other CNN-based segmentation frameworks, including the state-of-the-art DeepLab v3+, on the proposed real data set, and our approach achieves the highest mIoU (47.6%). The experimental results also show that the semantic information supports the restoration of a sharper reflectance map, thus further improving the segmentation. Besides, we pre-train a model with the proposed large-scale rendered images and then fine-tune it on the real images. The pre-training results in an improvement of mIoU by 7.2%. Our models and data set are publicly available for research. This research is part of the EU project INGENIOUS(1). Our data sets and models are available on our website(2).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available