4.6 Article

YOLO MDE: Object Detection with Monocular Depth Estimation

Journal

ELECTRONICS
Volume 11, Issue 1, Pages -

Publisher

MDPI
DOI: 10.3390/electronics11010076

Keywords

object detection; depth estimation; deep learning

Funding

  1. BK21 FOUR (Fostering Outstanding Universities for Research) [5199991614564]

Ask authors/readers for more resources

This study introduces an object detector with depth estimation using monocular camera images, focusing on predicting a single depth per object for improved risk assessment in autonomous driving. By adding an additional depth estimation channel to the YOLO v4 network architecture, training on dataset labels and benchmarking against existing models, it shows higher detection performance and speed with comparable depth accuracy.
This paper presents an object detector with depth estimation using monocular camera images. Previous detection studies have typically focused on detecting objects with 2D or 3D bounding boxes. A 3D bounding box consists of the center point, its size parameters, and heading information. However, predicting complex output compositions leads a model to have generally low performances, and it is not necessary for risk assessment for autonomous driving. We focused on predicting a single depth per object, which is essential for risk assessment for autonomous driving. Our network architecture is based on YOLO v4, which is a fast and accurate one-stage object detector. We added an additional channel to the output layer for depth estimation. To train depth prediction, we extract the closest depth from the 3D bounding box coordinates of ground truth labels in the dataset. Our model is compared with the latest studies on 3D object detection using the KITTI object detection benchmark. As a result, we show that our model achieves higher detection performance and detection speed than existing models with comparable depth accuracy.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available