4.6 Article

Few-Shot Text Classification with Global-Local Feature Information

期刊

SENSORS
卷 22, 期 12, 页码 -

出版社

MDPI
DOI: 10.3390/s22124420

关键词

text classification; few-shot learning; news categorization; feature selection

资金

  1. Key-Area Research and Development Program of Guangdong Province [2019B010153002]
  2. Key Program of NSFC-Guangdong Joint Funds [U1801263]
  3. Science and Technology Projects of Guangzhou [202007040006]
  4. Program of Marine Economy Development (Six Marine Industries) Special Foundation of Department of Natural Resources of Guangdong Province [GDNRC [2020]056]
  5. Top Youth Talent Project of Zhujiang Talent Program [2019QN01X516]
  6. National Key RD project [2019YFB1705503]
  7. R& D projects in key areas of Guangdong Province [2018B010109007]
  8. Guangdong Provincial Key Laboratory of Cyber-Physical Systems [2020B1212060069]

向作者/读者索取更多资源

In this paper, the authors propose SumFS, a meta-learning framework for text classification that utilizes extractive summarization and improved local vocabulary features to achieve domain adaptation with limited labeled data. Experimental results show that SumFS can reduce input features while maintaining or improving accuracy, and significantly decrease training time.
Meta-learning frameworks have been proposed to generalize machine learning models for domain adaptation without sufficient label data in computer vision. However, text classification with meta-learning is less investigated. In this paper, we propose SumFS to find global top-ranked sentences by extractive summary and improve the local vocabulary category features. The SumFS consists of three modules: (1) an unsupervised text summarizer that removes redundant information; (2) a weighting generator that associates feature words with attention scores to weight the lexical representations of words; (3) a regular meta-learning framework that trains with limited labeled data using a ridge regression classifier. In addition, a marine news dataset was established with limited label data. The performance of the algorithm was tested on THUCnews, Fudan, and marine news datasets. Experiments show that the SumFS can maintain or even improve accuracy while reducing input features. Moreover, the training time of each epoch is reduced by more than 50%.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据