4.5 Article

A method of cross-layer fusion multi-object detection and recognition based on improved faster R-CNN model in complex traffic environment ?

Journal

PATTERN RECOGNITION LETTERS
Volume 145, Issue -, Pages 127-134

Publisher

ELSEVIER
DOI: 10.1016/j.patrec.2021.02.003

Keywords

Multi-object detection; Multi-object recognition; Faster R-CNN; Weighted balanced multi-class cross; entropy loss function

Funding

  1. National Natural Science Foundation of China [61701060]
  2. Doctoral Talent Training Project of Chongqing University of Posts and Telecommunications [BYJS202007]

Ask authors/readers for more resources

This study proposes a multi-object detection and recognition algorithm based on Faster R-CNN, which tackles the challenge of object detection in complex traffic environments through multiscale fusion and feature enhancement. Experimental results show that the algorithm performs better on the Cityscapes and KITTI datasets compared to current mainstream models.
Improving the detection accuracy and speed is the prerequisite of multi-object recognition in the complex traffic environment. Despite object detection has made significant advances based on deep neural networks, it remains a challenge to focus on small and occlusion objects. We address this challenge by allowing multiscale fusion. We introduce a cross-layer fusion multi-object detection and recognition algorithm based on Faster R-CNN, an approach that the five-layer structure of VGG16 (Visual Geometry Group) is used to obtain more characteristic information. We implement this idea with lateral embedding the 1 & times;1 convolution kernel, max pooling and deconvolution, in conjunction with weighted balanced multi-class cross entropy loss function and Soft-NMS to control the imbalance between difficult and easy samples. Considering the actual situation in a complex traffic environment, we manually label mixed dataset. On Cityscapes and KITTI datasets, experimental results show that the proposed model achieves better effects than the current mainstream object detection models. (c) 2021 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available