4.7 Article

Substructural Regularization With Data-Sensitive Granularity for Sequence Transfer Learning

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNNLS.2016.2638321

关键词

Data-sensitive granularity; hidden Markov model (HMM); relative entropy (RE); sequence transfer learning; substructural regularization

资金

  1. Macau Science and Technology Development [019/2015/A]
  2. Multiyear Research Grants, University of Macau Multiyear Research Grants
  3. National Natural Science Foundation of China [61173035, 61472058, 61572540]
  4. Program for New Century Excellent Talents in University [NCET-11-0861]
  5. Liaoning Provincial Natural Science Foundation Guiding Program of China [201602195]
  6. Fundamental Research Funds for the Central Universities [DC201502030202]

向作者/读者索取更多资源

Sequence transfer learning is of interest in both academia and industry with the emergence of numerous new text domains from Twitter and other social media tools. In this paper, we put forward the data-sensitive granularity for transfer learning, and then, a novel substructural regularization transfer learning model (STLM) is proposed to preserve target domain features at substructural granularity in the light of the condition of labeled data set size. Our model is underpinned by hidden Markov model and regularization theory, where the substructural representation can be integrated as a penalty after measuring the dissimilarity of substructures between target domain and STLM with relative entropy. STLM can achieve the competing goals of preserving the target domain substructure and utilizing the observations from both the target and source domains simultaneously. The estimation of STLM is very efficient since an analytical solution can be derived as a necessary and sufficient condition. The relative usability of substructures to act as regularization parameters and the time complexity of STLM are also analyzed and discussed. Comprehensive experiments of part-of-speech tagging with both Brown and Twitter corpora fully justify that our model can make improvements on all the combinations of source and target domains.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据