☆ 4.5 Article

JSE: Joint Semantic Encoder for zero-shot gesture learning

PATTERN ANALYSIS AND APPLICATIONS (2022)

Journal

PATTERN ANALYSIS AND APPLICATIONS

Volume 25, Issue 3, Pages 679-692

Publisher

SPRINGER

DOI: 10.1007/s10044-021-00992-y

Keywords

Zero-shot learning; Gesture recognition; Feature selection; Transfer learning

Funding

Agency for Healthcare Research and Quality (AHRQ)
National Institute of Health (NIH) [1R18HS024887-01]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This study aims to investigate the impact of different feature extraction techniques on gesture recognition performance and proposes the Joint Semantic Encoder method. Regardless of attribute-based or cross-category conditions, JSE demonstrated a superior performance of 5% compared to other methods.

Zero-shot learning (ZSL) is a transfer learning paradigm that aims to recognize unseen categories just by having a high-level description of them. While deep learning has greatly pushed the limits of ZSL for object classification, ZSL for gesture recognition (ZSGL) remains largely unexplored. Previous attempts to address ZSGL were focused on the creation of gesture attributes and algorithmic improvements, and there is little or no research concerned with feature selection for ZSGL. It is indisputable that deep learning has obviated the need for feature engineering for problems with large datasets. However, when the data are scarce, it is critical to leverage the domain information to create discriminative input features. The main goal of this work is to study the effect of three different feature extraction techniques (velocity, heuristical and latent features) on the performance of ZSGL. In addition, we propose a bilinear auto-encoder approach, referred to as Joint Semantic Encoder (JSE), for ZSGL that jointly minimizes the reconstruction, semantic and classification losses. We conducted extensive experiments to compare and contrast the feature extraction techniques and to evaluate the performance of JSE with respect to existing ZSL methods. For attribute-based classification scenario, irrespective of the feature type, results showed that JSE outperforms other approaches by 5% (p<0.01). When JSE is trained with heuristical features in across-category condition, we showed that JSE significantly outperforms other methods by 5% (p<0.01)).

JSE: Joint Semantic Encoder for zero-shot gesture learning

Journal

PATTERN ANALYSIS AND APPLICATIONS

Publisher

SPRINGER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

JSE: Joint Semantic Encoder for zero-shot gesture learning

Journal

PATTERN ANALYSIS AND APPLICATIONS

Publisher

SPRINGER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper