4.6 Article

Feature learning based on SAE-PCA network for human gesture recognition in RGBD images

Journal

NEUROCOMPUTING
Volume 151, Issue -, Pages 565-573

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2014.06.086

Keywords

Deep learning; Auto-encoder; Convolutional neural networks; American sign language recognition

Funding

  1. Nature Science Foundation of China [61422210, 61373076, 61202143]
  2. Fundamental Research Funds for the Central Universities [2013121026, 2011121052]
  3. 985 Project of Xiamen University
  4. Natural Science Foundation of Fujian Province [2013J05100, 2010J01345, 2011J01367]
  5. Key Projects Fund of Science and Technology in Xiamen [3502Z20123017]
  6. Research Fund for the Doctoral Program of Higher Education of China [201101211120024]
  7. Special Fund for Developing Shenzhen's Strategic Emerging Industries [JCYJ20120614164600201]
  8. Hunan Provincial Natural Science Foundation [12JJ2040]
  9. Hunan Province Research Foundation of Education Committee [09A046]

Ask authors/readers for more resources

Coming with the emerging of depth sensors link Microsoft Kinect, human hand gesture recognition has received ever increasing research interests recently. A successful gesture recognition system has usually heavily relied on having a good feature representation of data, which is expected to be task-dependent as well as coping with the challenges and opportunities induced by depth sensor. In this paper, a feature learning approach based on sparse auto-encoder (SAE) and principle component analysis is proposed for recognizing human actions, i.e. finger-spelling or sign language, for RGB-D inputs. The proposed model of feature learning is consisted of two components: First, features are learned respectively from the RGB and depth channels, using sparse auto-encoder with convolutional neural networks. Second, the learned features from both channels is concatenated and fed into a multiple layer PCA to get the final feature. Experimental results on American sign language (ASL) dataset demonstrate that the proposed feature learning model is significantly effective, which improves the recognition rate from 75% to 99.05% and outperforms the state-of-the-art. (C) 2014 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available