4.7 Article

Semantic segmentation for multiscale target based on object recognition using the improved Faster-RCNN model

Publisher

ELSEVIER
DOI: 10.1016/j.future.2021.04.019

Keywords

Semantic segmentation; Object recognition; Multiscale target; Multi-task; Faster-RCNN

Funding

  1. National Natural Science Foundation of China [52075530, 51575407, 51505349, 61733011, 41906177]
  2. Hubei Provincial Department of Education, China [D20191105]
  3. National Defense PreResearch Foundation of Wuhan University of Science and Technology, China [GF201705]
  4. Open Fund of the Key Laboratory for Metallurgical Equipment and Control of Ministry of Education in Wuhan University of Science and Technology, China [2018B07, MECOF2019B13]

Ask authors/readers for more resources

This paper proposes a multi-task semantic segmentation model in complex indoor environments using the improved Faster-RCNN algorithm, addressing issues of uneven lighting by enhancing fusion methods and algorithms. The model achieves high performance and efficiency in segmenting contours of different scale objects.
Image semantic segmentation has received great attention in computer vision, whose aim is to segment different objects and provide them different semantic category labels so that the computer can fully obtain the semantic information of the scene. However, the current research mainly focuses on color image data as training, for outdoor scenes and single task semantic segmentation. This paper carries out multi-task semantic segmentation model in the complex indoor environment on joint target detection using RGB-D image information based on the improved Faster-RCNN algorithm, which can simultaneously realize the indoor scene semantic segmentation, target classification and detection multiple visual tasks. In which, in view of the influence of uneven lighting in the environment, the method of fusion of RGB images and depth images is improved. While enhancing the fusion image feature information, it also improves the efficiency of model training. Simultaneously, in order to meet the needs for operating on multi-scale target objects, the non-maximum value suppression algorithm is improved to improve the model's performance. So as to realize the output of the model's multi-task information, the loss function has also been redesigned and optimized. The indoor scene semantic segmentation model constructed in this paper not only has good performance and high efficiency, but also can segment the contours of different scale objects clearly and adapt to the indoor uneven lighting environment. (C) 2021 Published by Elsevier B.V.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available