4.6 Article

A robust ensemble approach to learn from positive and unlabeled data using SVM base models

期刊

NEUROCOMPUTING
卷 160, 期 -, 页码 73-84

出版社

ELSEVIER
DOI: 10.1016/j.neucom.2014.10.081

关键词

Classification; Semi-supervised learning; Ensemble learning; Support vector machine

资金

  1. Research Council KU Leuven [GOA/10/09 MaNet, CoE PFV/10/002, BIL12/11T]
  2. Flemish Government
  3. FWO [G.0871.12N, G.0377.12, G.088114N]
  4. IWT [100793, 100783, 130256, 100031, 111065]
  5. Industrial Research fund (IOF) [IOF/HB/13/027]
  6. iMinds Medical IT SBO
  7. VLK Stichting E. van der Schueren: rectal cancer
  8. Federal Government: FOD [KPC-29-023]
  9. Belgian Federal Science Policy Office [MAP P7/19]
  10. COST: Action [BM1104]
  11. EU
  12. European Research Council under European Union's Seventh Framework Programme/ERC [290923]

向作者/读者索取更多资源

We present a novel approach to learn binary classifiers when only positive and unlabeled instances are available (PU learning). This problem is routinely cast as a supervised task with label noise in the negative set. We use an ensemble of SVM models trained on bootstrap resamples of the training data for increased robustness against label noise. The approach can be considered in a bagging framework which provides an intuitive explanation for its mechanics in a semi-supervised setting. We compared our method to state-of-the-art approaches in simulations using multiple public benchmark data sets. The included benchmark comprises three settings with increasing label noise: (i) fully supervised, (ii) PU learning and (iii) PU learning with false positives. Our approach shows a marginal improvement over existing methods in the second setting and a significant improvement in the third. (C) 2015 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据