4.5 Article

Stacked Convolutional Sparse Auto-Encoders for Representation Learning

出版社

ASSOC COMPUTING MACHINERY
DOI: 10.1145/3434767

关键词

Sparse auto-encoder; representation learning

资金

  1. National Key Research and Development Program of China [2016YFB1000901]
  2. Natural Science Foundation of China [61906060, 91746209]
  3. Program for Changjiang Scholars and Innovative Research Team in University (PCSIRT) of the Ministry of Education [IRT17R32]

向作者/读者索取更多资源

The research proposes a semi-supervised deep learning framework to address the issue of insufficient labeled image data, utilizing stacked layers, convolutional approach, and sparse auto-encoder to learn feature representations. The framework also includes an algorithm to handle data redundancy and encodes label information using a Softmax regression model.
Deep learning seeks to achieve excellent performance for representation learning in image datasets. However, supervised deep learning models such as convolutional neural networks require a large number of labeled image data, which is intractable in applications, while unsupervised deep learning models like stacked denoising auto-encoder cannot employ label information. Meanwhile, the redundancy of image data incurs performance degradation on representation learning for aforementioned models. To address these problems, we propose a semi-supervised deep learning framework called stacked convolutional sparse auto-encoder, which can learn robust and sparse representations from image data with fewer labeled data records. More specifically, the framework is constructed by stacking layers. In each layer, higher layer feature representations are generated by features of lower layers in a convolutional way with kernels learned by a sparse auto-encoder. Meanwhile, to solve the data redundance problem, the algorithm of Reconstruction Independent Component Analysis is designed to train on patches for sphering the input data. The label information is encoded using a Softmax Regression model for semi-supervised learning. With this framework, higher level representations are learned by layers mapping from image data. It can boost the performance of the base subsequent classifiers such as support vector machines. Extensive experiments demonstrate the superior classification performance of our framework compared to several state-of-the-art representation learning methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据