4.4 Article Retracted Publication

被撤回的出版物: Gesture recognition algorithm based on multi-scale feature fusion in RGB-D images (Retracted article. See vol. 17, pg. 301, 2023)

期刊

IET IMAGE PROCESSING
卷 14, 期 15, 页码 3662-3668

出版社

WILEY
DOI: 10.1049/iet-ipr.2020.0148

关键词

image colour analysis; gesture recognition; image sensors; convolutional neural nets; video signal processing; artificial intelligence; gesture recognition algorithm; multiscale feature fusion; RGB-D images; sensor technology; artificial intelligence; video gesture recognition technology; Big Data; on-board control; electronic games; robust recognition; illumination change; background clutter; partial occlusion; multilevel feature fusion; two-stream convolutional neural network; Kinect sensor; red-green-blue-depth images; gesture database; data enhancement; complex backgrounds

资金

  1. National Natural Science Foundation of China [51575407, 51505349, 61733011, 41906177]
  2. Hubei Provincial Department of Education [D20191105]
  3. National Defense Pre-Research Foundation of Wuhan University of Science and Technology [GF201705]
  4. Open Fund of the Key Laboratory for Metallurgical Equipment and Control of Ministry of Education in Wuhan University of Science and Technology [2018B07]

向作者/读者索取更多资源

With the rapid development of sensor technology and artificial intelligence, the video gesture recognition technology under the background of big data makes human-computer interaction more natural and flexible, bringing the richer interactive experience to teaching, on-board control, electronic games etc. To perform robust recognition under the conditions of illumination change, background clutter, rapid movement, and partial occlusion, an algorithm based on multi-level feature fusion of two-stream convolutional neural network is proposed, which includes three main steps. Firstly, the Kinect sensor obtains red-green-blue-depth (RGB-D) images to establish a gesture database. At the same time, data enhancement is performed on the training set and test set. Then, a model of multi-level feature fusion of a two-stream convolutional neural network is established and trained. Experiments show that the proposed network model can robustly track and recognise gestures under complex backgrounds (such as similar complexion, illumination changes, and occlusion), and compared with the single-channel model, the average detection accuracy is improved by 1.08%, and mean average precision is improved by 3.56%.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据