4.6 Article

Auto-encoder based structured dictionary learning for visual classification

Journal

NEUROCOMPUTING
Volume 438, Issue -, Pages 34-43

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2020.09.088

Keywords

Structured dictionary learning; Image classification; Image set classification; Convolutional encoder based block sparse; representation

Funding

  1. Overseas Study Fund of China Scholarship Council
  2. National Natural Science Foundation of China [U19A2073, U1804152, 62002096]
  3. Fundamental Research Funds for the Central Universities [JZ2020HGTB0050]

Ask authors/readers for more resources

The paper introduces a novel deep Auto-Encoder based Structured Dictionary (AESD) learning model that only requires learning one dictionary composed of class-specific sub-dictionaries, with supervision from discriminative category constraints. By optimizing the learning process based on the dictionary, a light-weight network training is achieved. Additionally, the proposed method is extended into a Convolutional Encoder based Block Sparse Representation (CEBSR) model in the testing phase to enhance image set based classification.
Dictionary learning and deep learning can be combined to boost the performance of classification tasks. However, existing combined methods often learn multi-level dictionaries each of which is embedded in a network layer, involve a large number of parameters (elements of many dictionaries) and thus easily result in prohibitive computational cost and even overfitting. In this paper, we present a novel deep Auto-Encoder based Structured Dictionary (AESD) learning model, where we need to learn only one dictionary which is composed of class-specific sub-dictionaries, and supervision is introduced by imposing discriminative category constraints to empower the dictionary with discrimination. The encoding layers are designed with shared parameters which are exactly dependent on the dictionary carried by the decoding layer. This characterizes the learning process by forward-propagation based optimization w.r. t the dictionary only, leading to a light-weight network training. In addition to utilizing directly the trained encoding network combined with a minimum-reconstruction-residual scheme for single image based classification, to expand the application spectrum of our method, in the testing phase, we extend the proposed prototype into a Convolutional Encoder based Block Sparse Representation (CEBSR) model to promote the latent block sparsity in the joint representation of an image set, achieving improved image set based classification. Extensive experiments verify the performance of the learned dictionary for image classification, and the superiority of our extended model over the state-of-the-art image set classification methods. (c) 2021 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available