4.7 Article

Cascade saccade machine learning network with hierarchical classes for traffic sign detection

期刊

SUSTAINABLE CITIES AND SOCIETY
卷 67, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.scs.2020.102700

关键词

Traffic sign detection; Attention mechanism; Hierarchical semantic tree; Class hierarchy; Autonomous driving technology

资金

  1. National Natural Science Foundation of China [61703054, U1864204]
  2. Key Research and Development Program of Shaanxi Province [2018ZDXMGY044]
  3. National Key R&D Program of China [2019YFB1600103]
  4. Fundamental Research Funds for the Central Universities [300102248202, 300102240501]

向作者/读者索取更多资源

Traffic signs detection is crucial for autonomous driving, yet challenges remain due to uneven distribution of training samples and appearance variations of signs. The proposed cascade saccade network in this paper shows promising results in detecting small size traffic signs and meets real-time driving safety application requirements.
Traffic signs detection is one of the significant tasks for autonomous driving. It conveys notable traffic information timely to road users and maintains traffic safety in smart grid of cities. However, the size of most traffic signs is less than 0.5% of the image of traffic scene, and the uneven distribution of training samples limits the accuracy of the model. Moreover, the appearance of traffic signs within the same category of meaning always varies in shapes, colors, from one country to another. Few works have provided robust solutions to these problems simultaneously. In this paper, motivated from the property of saccade in human vision, we developed a novel architecture cascade saccade network with class hierarchy structure for traffic sign detection and domain shift problem. Experiments on Chinese traffic sign benchmarks (TT100K) demonstrated that the proposed detector achieves comparable performance with the state-of-the-art methods with an average performance improvement of 6% in precision and 14% in recall for small size, and detection time with a GPU is 0.08 s per 2048 x 2048 sized image, which can satisfy the real time requirements of driving safety applications. Moreover, proposed model can be easily extended to solve the cross-domain detection of traffic signs.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据