4.7 Article

Two-stage three-way enhanced technique for ensemble learning in inclusive policy text classification

期刊

INFORMATION SCIENCES
卷 547, 期 -, 页码 271-288

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2020.08.051

关键词

Three-way decisions; Decision support; Ensemble learning; Convolutional neural networks; Inclusive policy text classification

资金

  1. National Natural Science Foundation of China [71401026, 71432003, 61773352]
  2. Planning Fund for the Humanities and Social Sciences of Ministry of Education of China [19YJA630042]
  3. Double First-class Construction Research Support Project of UESTC [SYLYJ2019210]

向作者/读者索取更多资源

This study proposes a two-stage three-way enhanced technique to automatically classify policy text paragraphs into predefined categories. Experimental results show that the proposed method effectively supports the design of policy recommended platforms and serves SMEs.
With the development of the social economy, small and medium-sized enterprises (SMEs) play a vital role in promoting economic development. Multiple local governments in China are developing policy recommended platforms in order to help SMEs better understand the inclusive policy. However, these online platforms manually extract the key information from the inclusive policy texts, which takes a lot of time and causes low efficiency. The policy text is composed of some paragraphs and each paragraph corresponds to a topic. When we classify the paragraphs into different topics, there exists a decision risk of text misclassification. Therefore, we design two-stage based three-way enhanced technique to automatically classify these text paragraphs into the predefined categories. At the first stage, by using ensemble learning algorithms, we construct an ensemble convolution neural network (CNN) model in order to ensure the generalization ability and stability of text classification results. Meanwhile, we develop a new weight determination method to integrate the prediction results of all base classifiers according to the accuracy and classification confidence. With the help of three-way decisions (3WD), we assign the samples with poor resolution to the boundary area for secondary classification, which can reduce the decision risk. At the second stage, in order to classify the boundary region samples and improve the overall classification results, we further utilize traditional machine learning method as the secondary classifier. Finally, we develop some comparison experiments to verify our proposed method. The experimental results show that the two-stage three-way enhanced classification framework is valid and obtains a better performance. Our proposed method can effectively support the designment of policy recommended platforms and serve SMEs. (C) 2020 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据