4.7 Article

Representation learning with deep sparse auto-encoder for multi-task learning

期刊

PATTERN RECOGNITION
卷 129, 期 -, 页码 -

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2022.108742

关键词

Deep sparse auto-encoder; Multi-task learning; RICA; Labeled and unlabeled data

资金

  1. National Natural Science Foundation of China [6190 60 60, 91746209, 62076217]
  2. National Key Research and Development Program of China [2016YFB1000901]
  3. Yangzhou University Interdis-ciplinary Research Foundation for Animal Husbandry Discipline of Targeted Support [yzuxk202015]
  4. Opening Foundation of Key Laboratory of Huizhou Architecture in Anhui Province [HPJZ-2020-02]
  5. Open Project Program of Joint Interna-tional Research Laboratory of Agriculture and Agri-Product Safety [JILAR-KF202104]

向作者/读者索取更多资源

This paper introduces a multi-task learning framework based on DSML and SRICA models, which optimize with unlabeled data to achieve better performance.
We demonstrate an effective framework to achieve a better performance based on Deep Sparse auto encoder for Multi-task Learning, called DSML for short. To learn the reconstructed and higher-level features on cross-domain instances for multiple tasks, we combine the labeled and unlabeled data from all tasks to reconstruct the feature representations. Furthermore, we propose the model of Stacked Reconstruction Independence Component Analysis (SRICA for short) for the optimization of feature representations with a large amount of unlabeled data, which can effectively address the redundancy of image data. Our proposed SRICA model is developed from RICA and is based on deep sparse auto-encoder. In addition, we adopt a Semi-Supervised Learning method (SSL for short) based on model parameter regularization to build a unified model for multi-task learning. There are several advantages in our proposed framework as follows: 1) The proposed SRICA makes full use of a large amount of unlabeled data from all tasks. It is used to pursue an optimal sparsity feature representation, which can overcome the over fitting problem effectively. 2) The deep architecture used in our SRICA model is applied for higher-level and better representation learning, which is designed to train on patches for sphering the input data. 3) Training parameters in our proposed framework has lower computational cost compared to other common deep learning methods such as stacked denoising auto-encoders. Extensive experiments on several real image datasets demonstrate our proposed framework outperforms the state-of-the-art methods.(c) 2022 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据