4.7 Article

A two-stage real-time YOLOv2-based road marking detector with lightweight spatial transformation-invariant classification

Journal

IMAGE AND VISION COMPUTING
Volume 102, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.imavis.2020.103978

Keywords

Deep learning; Road marking; Spatial transform; Real-time object detection; Object classification

Funding

  1. Ministry of Science and Technology (MOST), Taiwan ROC [108-2634-F-002-016, 108-2634-F-002-017, 108-2221-E-390-019-MY3]
  2. Center for AI AMP
  3. Advanced Robotics, National Taiwan University
  4. Joint Research Center for AI Technology under MOST
  5. All Vista Healthcare under MOST

Ask authors/readers for more resources

In recent years, Autonomous Driving Systems (ADS) become more and more popular and reliable. Road markings are important for drivers and advanced driver assistance systems by better understanding the road environment. While the detection of road markings may suffer a lot from various illuminations, weather conditions and angles of view, most traditional road marking detection methods use fixed threshold to detect road markings, which is not robust enough to handle various situations in the real world. To deal with this problem, some deep learning-based real-time detection frameworks such as Single Shot Detector (SSD) and You Only Look Once (YOLO) are suitable for this task. However, these deep learning-based methods are data-driven even while there is no public road marking dataset. Besides, these detection frameworks usually struggle with distorted road markings and balancing between the precision and recall. We propose a two-stage YOLOv2-based network to tackle distorted road marking detection as well as to balance precision and recall. The proposed spatial transformer layer is able to handle the distorted road markings in the second stage, so as to achieve the improvement of precision. Our network is able to run at 58 FPS in a single GTX 1070 under diverse circumstances. Furthermore, we present a dataset for the public use of road marking detection tasks, which consists of 11,800 high-resolution images captured under different weather conditions. Specifically, the images are manually annotated into 13 classes with bounding boxes. We empirically demonstrate both mean average precision ( mAP) and detection speed of our system over several baseline models. (C) 2020 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available