☆ 4.7 Article

Adapting Gaussian YOLOv3 with transfer learning for overhead view human detection in smart cities and societies

SUSTAINABLE CITIES AND SOCIETY (2021)

期刊

SUSTAINABLE CITIES AND SOCIETY

卷 70, 期 -, 页码 -

出版社

ELSEVIER

DOI: 10.1016/j.scs.2021.102908

关键词

Deep neural network; Smart cities and societies; Human detection; Overhead view; Transfer learning; Gaussian YOLOv3

类别

Construction & Building Technology Green & Sustainable Science & Technology Energy & Fuels

资金

King Saud University

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper introduces a human detection system based on deep neural networks using an overhead perspective for intelligent surveillance in smart cities and societies. By utilizing the Gaussian YOLOv3 algorithm to improve the human detection system, the experimental results show an overall detection accuracy of 94%.

Nowadays, deep neural networks are widely applied in sustainable smart cities and societies, including smart manufacturing, healthcare, industries, agriculture, surveillance, and various artificial intelligence-based real-life applications. In this regard, the human detection system has gained notable attention since it is recognized as a crucial task in intelligent surveillance applications. Researchers practiced a variety of computer vision and deep neural networks-based techniques for human detection-based applications; however, they often focused on the frontal view camera perspective. Thus, in this work, we have introduced a human detection system for intelligent surveillance in smart cities and societies with a completely distinct perspective, i.e., an overhead perspective that can provide sufficient visibility and coverage of a scene in congested and obstructed environments. However, human appearance can be difficult from such an extreme point of view, as there are significant variations in humans' poses and appearances. Therefore, in this work, leveraging the deep neural network-based object detection technique, the Gaussian YOLOv3 algorithm is used for human detection. The algorithm determines the bounding box uncertainty by modeling its coordinates as a Gaussian parameter, improving accuracy and reducing false positives. A Gaussian YOLOv3 is combined with channel attention and feature intertwine modules to improve specific feature maps. The channel attention module is combined with the feature map to learn each channel's weight autonomously, improve the key features, and enhance the network's ability to discriminate between humans and background. At the same time, different channels of the feature map are intertwined to obtain more representative features. Finally, the features obtained from the attention and feature intertwine modules are fused to form an improved feature map. In addition, to further increase the detection accuracy of the algorithm for human detection, transfer learning is adopted. The experimental outcomes reveal that training improves the Gaussian YOLOv3 algorithm's potential for human detection with an overall detection accuracy of 94%.

Adapting Gaussian YOLOv3 with transfer learning for overhead view human detection in smart cities and societies

期刊

SUSTAINABLE CITIES AND SOCIETY

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Adapting Gaussian YOLOv3 with transfer learning for overhead view human detection in smart cities and societies

期刊

SUSTAINABLE CITIES AND SOCIETY

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文