4.7 Article

Adapting Gaussian YOLOv3 with transfer learning for overhead view human detection in smart cities and societies

期刊

SUSTAINABLE CITIES AND SOCIETY
卷 70, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.scs.2021.102908

关键词

Deep neural network; Smart cities and societies; Human detection; Overhead view; Transfer learning; Gaussian YOLOv3

资金

  1. King Saud University

向作者/读者索取更多资源

This paper introduces a human detection system based on deep neural networks using an overhead perspective for intelligent surveillance in smart cities and societies. By utilizing the Gaussian YOLOv3 algorithm to improve the human detection system, the experimental results show an overall detection accuracy of 94%.
Nowadays, deep neural networks are widely applied in sustainable smart cities and societies, including smart manufacturing, healthcare, industries, agriculture, surveillance, and various artificial intelligence-based real-life applications. In this regard, the human detection system has gained notable attention since it is recognized as a crucial task in intelligent surveillance applications. Researchers practiced a variety of computer vision and deep neural networks-based techniques for human detection-based applications; however, they often focused on the frontal view camera perspective. Thus, in this work, we have introduced a human detection system for intelligent surveillance in smart cities and societies with a completely distinct perspective, i.e., an overhead perspective that can provide sufficient visibility and coverage of a scene in congested and obstructed environments. However, human appearance can be difficult from such an extreme point of view, as there are significant variations in humans' poses and appearances. Therefore, in this work, leveraging the deep neural network-based object detection technique, the Gaussian YOLOv3 algorithm is used for human detection. The algorithm determines the bounding box uncertainty by modeling its coordinates as a Gaussian parameter, improving accuracy and reducing false positives. A Gaussian YOLOv3 is combined with channel attention and feature intertwine modules to improve specific feature maps. The channel attention module is combined with the feature map to learn each channel's weight autonomously, improve the key features, and enhance the network's ability to discriminate between humans and background. At the same time, different channels of the feature map are intertwined to obtain more representative features. Finally, the features obtained from the attention and feature intertwine modules are fused to form an improved feature map. In addition, to further increase the detection accuracy of the algorithm for human detection, transfer learning is adopted. The experimental outcomes reveal that training improves the Gaussian YOLOv3 algorithm's potential for human detection with an overall detection accuracy of 94%.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据