4.7 Article

Manifold Adaptive Experimental Design for Text Categorization

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2011.104

关键词

Text categorization; active learning; experimental design; manifold learning; kernel method

资金

  1. National Natural Science Foundation of China [60905001, 90920303]
  2. National Basic Research Program of China (973 Program) [2011CB302206]
  3. Fundamental Research Funds for the Central Universities

向作者/读者索取更多资源

In many information processing tasks, labels are usually expensive and the unlabeled data points are abundant. To reduce the cost on collecting labels, it is crucial to predict which unlabeled examples are the most informative, i.e., improve the classifier the most if they were labeled. Many active learning techniques have been proposed for text categorization, such as SVMActive and Transductive Experimental Design. However, most of previous approaches try to discover the discriminant structure of the data space, whereas the geometrical structure is not well respected. In this paper, we propose a novel active learning algorithm which is performed in the data manifold adaptive kernel space. The manifold structure is incorporated into the kernel space by using graph Laplacian. This way, the manifold adaptive kernel space reflects the underlying geometry of the data. By minimizing the expected error with respect to the optimal classifier, we can select the most representative and discriminative data points for labeling. Experimental results on text categorization have demonstrated the effectiveness of our proposed approach.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据