4.7 Article

Semantic segmentation for multiscale target based on object recognition using the improved Faster-RCNN model

出版社

ELSEVIER
DOI: 10.1016/j.future.2021.04.019

关键词

Semantic segmentation; Object recognition; Multiscale target; Multi-task; Faster-RCNN

资金

  1. National Natural Science Foundation of China [52075530, 51575407, 51505349, 61733011, 41906177]
  2. Hubei Provincial Department of Education, China [D20191105]
  3. National Defense PreResearch Foundation of Wuhan University of Science and Technology, China [GF201705]
  4. Open Fund of the Key Laboratory for Metallurgical Equipment and Control of Ministry of Education in Wuhan University of Science and Technology, China [2018B07, MECOF2019B13]

向作者/读者索取更多资源

This paper proposes a multi-task semantic segmentation model in complex indoor environments using the improved Faster-RCNN algorithm, addressing issues of uneven lighting by enhancing fusion methods and algorithms. The model achieves high performance and efficiency in segmenting contours of different scale objects.
Image semantic segmentation has received great attention in computer vision, whose aim is to segment different objects and provide them different semantic category labels so that the computer can fully obtain the semantic information of the scene. However, the current research mainly focuses on color image data as training, for outdoor scenes and single task semantic segmentation. This paper carries out multi-task semantic segmentation model in the complex indoor environment on joint target detection using RGB-D image information based on the improved Faster-RCNN algorithm, which can simultaneously realize the indoor scene semantic segmentation, target classification and detection multiple visual tasks. In which, in view of the influence of uneven lighting in the environment, the method of fusion of RGB images and depth images is improved. While enhancing the fusion image feature information, it also improves the efficiency of model training. Simultaneously, in order to meet the needs for operating on multi-scale target objects, the non-maximum value suppression algorithm is improved to improve the model's performance. So as to realize the output of the model's multi-task information, the loss function has also been redesigned and optimized. The indoor scene semantic segmentation model constructed in this paper not only has good performance and high efficiency, but also can segment the contours of different scale objects clearly and adapt to the indoor uneven lighting environment. (C) 2021 Published by Elsevier B.V.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据