☆ 4.7 Article

Swin transformer based vehicle detection in undisciplined traffic environment

EXPERT SYSTEMS WITH APPLICATIONS (2023)

期刊

EXPERT SYSTEMS WITH APPLICATIONS

卷 213, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.eswa.2022.118992

关键词

Deep learning; Undisciplined traffic environment; Visual transformer; Vehicle detection

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic Operations Research & Management Science

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Intelligent vehicle detection (IVD) plays a crucial role in intelligent traffic management systems, especially in undisciplined traffic environments. This paper proposes a swin transformer-based vehicle detection (STVD) framework that effectively addresses the multi-scale feature extraction problem and achieves high detection accuracy on various datasets.

Intelligent vehicle detection (IVD) plays a prominent role in evolving an intelligent traffic management system (ITMS). It can help to decrease the average waiting time at the traffic post, save fuel consumption, control traffic congestion, decrease accident rates, and build up human safety. Recent developments in the artificial intelligence (AI) domain have increased the demand for IVD in the undisciplined traffic environment, which is a usual condition in developing countries. IVD is a difficult task in an undisciplined traffic environment because different vehicle categories travel very close to each other on the roads and do not follow traffic rules. Previously, several convolutional neural network (CNN) based deep learning (DL), and visual transformer-based techniques for vehicle and object detection have been presented. They are complex and do not accurately extract multi-scale features due to the involvement of existing CNN feature extraction backbones. Also, most techniques failed to account for an undisciplined traffic environment due to the unavailability of labeled vehicle datasets. Therefore, this paper proposes a swin transformer-based vehicle detection (STVD) framework in an undisciplined traffic environment. Swin transformer (ST) wholly exchanges information within and between image patches and provides hierarchical feature maps, effectively alleviating the multi-scale feature extraction problem. A bi-directional feature pyramid network (BIFPN) is presented, which combines low -resolution features with high-resolution features in a bidirectional way and provides robust multi-scale features with different scales and resolutions. A fully connected vehicle detection head (FCVDH) is applied to improve the matching relationship between vehicle sizes and the BIFPN hierarchy. FCVDH predicts the locations and categories of vehicles in the input image. STVD is analyzed, experimented, and measured over realistic traffic data. Also, it is compared with the existing state-of-the-art vehicle detection methods. It achieves 91.32% detection accuracy on diverse traffic labeled dataset (DTLD), 87.4% on IITM-hetra, and 88.45% on KITTI datasets.

Swin transformer based vehicle detection in undisciplined traffic environment

期刊

EXPERT SYSTEMS WITH APPLICATIONS

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Swin transformer based vehicle detection in undisciplined traffic environment

期刊

EXPERT SYSTEMS WITH APPLICATIONS

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文