☆ 4.6 Article

Detecting Human Actions in Drone Images Using YoloV5 and Stochastic Gradient Boosting

SENSORS (2022)

期刊

SENSORS

卷 22, 期 18, 页码 -

出版社

MDPI

DOI: 10.3390/s22187020

关键词

action detection; YoloV5; gradient boosting classifier

类别

Chemistry, Analytical Engineering, Electrical & Electronic Instruments & Instrumentation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Human action recognition and detection from drones is a challenging task due to image acquisition and processing constraints. We propose a method that combines object recognition and classifier techniques for action identification. Our method outperforms previous approaches on the Okutama dataset, indicating its suitability for this specific dataset.

Human action recognition and detection from unmanned aerial vehicles (UAVs), or drones, has emerged as a popular technical challenge in recent years, since it is related to many use case scenarios from environmental monitoring to search and rescue. It faces a number of difficulties mainly due to image acquisition and contents, and processing constraints. Since drones' flying conditions constrain image acquisition, human subjects may appear in images at variable scales, orientations, and occlusion, which makes action recognition more difficult. We explore low-resource methods for ML (machine learning)-based action recognition using a previously collected real-world dataset (the Okutama-Action dataset). This dataset contains representative situations for action recognition, yet is controlled for image acquisition parameters such as camera angle or flight altitude. We investigate a combination of object recognition and classifier techniques to support single-image action identification. Our architecture integrates YoloV5 with a gradient boosting classifier; the rationale is to use a scalable and efficient object recognition system coupled with a classifier that is able to incorporate samples of variable difficulty. In an ablation study, we test different architectures of YoloV5 and evaluate the performance of our method on Okutama-Action dataset. Our approach outperformed previous architectures applied to the Okutama dataset, which differed by their object identification and classification pipeline: we hypothesize that this is a consequence of both YoloV5 performance and the overall adequacy of our pipeline to the specificities of the Okutama dataset in terms of bias-variance tradeoff.

Detecting Human Actions in Drone Images Using YoloV5 and Stochastic Gradient Boosting

期刊

SENSORS

出版社

MDPI

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Detecting Human Actions in Drone Images Using YoloV5 and Stochastic Gradient Boosting

期刊

SENSORS

出版社

MDPI

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文