☆ 4.7 Article

Semantic segmentation for multiscale target based on object recognition using the improved Faster-RCNN model

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE (2021)

期刊

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE

卷 123, 期 -, 页码 94-104

出版社

ELSEVIER

DOI: 10.1016/j.future.2021.04.019

关键词

Semantic segmentation; Object recognition; Multiscale target; Multi-task; Faster-RCNN

类别

Computer Science, Theory & Methods

资金

National Natural Science Foundation of China [52075530, 51575407, 51505349, 61733011, 41906177]
Hubei Provincial Department of Education, China [D20191105]
National Defense PreResearch Foundation of Wuhan University of Science and Technology, China [GF201705]
Open Fund of the Key Laboratory for Metallurgical Equipment and Control of Ministry of Education in Wuhan University of Science and Technology, China [2018B07, MECOF2019B13]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes a multi-task semantic segmentation model in complex indoor environments using the improved Faster-RCNN algorithm, addressing issues of uneven lighting by enhancing fusion methods and algorithms. The model achieves high performance and efficiency in segmenting contours of different scale objects.

Image semantic segmentation has received great attention in computer vision, whose aim is to segment different objects and provide them different semantic category labels so that the computer can fully obtain the semantic information of the scene. However, the current research mainly focuses on color image data as training, for outdoor scenes and single task semantic segmentation. This paper carries out multi-task semantic segmentation model in the complex indoor environment on joint target detection using RGB-D image information based on the improved Faster-RCNN algorithm, which can simultaneously realize the indoor scene semantic segmentation, target classification and detection multiple visual tasks. In which, in view of the influence of uneven lighting in the environment, the method of fusion of RGB images and depth images is improved. While enhancing the fusion image feature information, it also improves the efficiency of model training. Simultaneously, in order to meet the needs for operating on multi-scale target objects, the non-maximum value suppression algorithm is improved to improve the model's performance. So as to realize the output of the model's multi-task information, the loss function has also been redesigned and optimized. The indoor scene semantic segmentation model constructed in this paper not only has good performance and high efficiency, but also can segment the contours of different scale objects clearly and adapt to the indoor uneven lighting environment. (C) 2021 Published by Elsevier B.V.

Semantic segmentation for multiscale target based on object recognition using the improved Faster-RCNN model

期刊

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Semantic segmentation for multiscale target based on object recognition using the improved Faster-RCNN model

期刊

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文