4.4 Article Retracted Publication

被撤回的出版物: Gesture recognition algorithm based on multi-scale feature fusion in RGB-D images (Retracted article. See vol. 17, pg. 301, 2023)

Journal

IET IMAGE PROCESSING
Volume 14, Issue 15, Pages 3662-3668

Publisher

WILEY
DOI: 10.1049/iet-ipr.2020.0148

Keywords

image colour analysis; gesture recognition; image sensors; convolutional neural nets; video signal processing; artificial intelligence; gesture recognition algorithm; multiscale feature fusion; RGB-D images; sensor technology; artificial intelligence; video gesture recognition technology; Big Data; on-board control; electronic games; robust recognition; illumination change; background clutter; partial occlusion; multilevel feature fusion; two-stream convolutional neural network; Kinect sensor; red-green-blue-depth images; gesture database; data enhancement; complex backgrounds

Funding

  1. National Natural Science Foundation of China [51575407, 51505349, 61733011, 41906177]
  2. Hubei Provincial Department of Education [D20191105]
  3. National Defense Pre-Research Foundation of Wuhan University of Science and Technology [GF201705]
  4. Open Fund of the Key Laboratory for Metallurgical Equipment and Control of Ministry of Education in Wuhan University of Science and Technology [2018B07]

Ask authors/readers for more resources

With the rapid development of sensor technology and artificial intelligence, the video gesture recognition technology under the background of big data makes human-computer interaction more natural and flexible, bringing the richer interactive experience to teaching, on-board control, electronic games etc. To perform robust recognition under the conditions of illumination change, background clutter, rapid movement, and partial occlusion, an algorithm based on multi-level feature fusion of two-stream convolutional neural network is proposed, which includes three main steps. Firstly, the Kinect sensor obtains red-green-blue-depth (RGB-D) images to establish a gesture database. At the same time, data enhancement is performed on the training set and test set. Then, a model of multi-level feature fusion of a two-stream convolutional neural network is established and trained. Experiments show that the proposed network model can robustly track and recognise gestures under complex backgrounds (such as similar complexion, illumination changes, and occlusion), and compared with the single-channel model, the average detection accuracy is improved by 1.08%, and mean average precision is improved by 3.56%.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available