☆ 4.5 Article

Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study

KNOWLEDGE AND INFORMATION SYSTEMS (2015)

期刊

KNOWLEDGE AND INFORMATION SYSTEMS

卷 42, 期 2, 页码 245-284

出版社

SPRINGER LONDON LTD

DOI: 10.1007/s10115-013-0706-y

关键词

Learning from unlabeled data; Semi-supervised learning; Self-training; Co-training; Multi-view learning; Classification

类别

Computer Science, Artificial Intelligence Computer Science, Information Systems

资金

[TIN2011-28488]
[TIC-6858]
[P11-TIC-7765]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Semi-supervised classification methods are suitable tools to tackle training sets with large amounts of unlabeled data and a small quantity of labeled data. This problem has been addressed by several approaches with different assumptions about the characteristics of the input data. Among them, self-labeled techniques follow an iterative procedure, aiming to obtain an enlarged labeled data set, in which they accept that their own predictions tend to be correct. In this paper, we provide a survey of self-labeled methods for semi-supervised classification. From a theoretical point of view, we propose a taxonomy based on the main characteristics presented in them. Empirically, we conduct an exhaustive study that involves a large number of data sets, with different ratios of labeled data, aiming to measure their performance in terms of transductive and inductive classification capabilities. The results are contrasted with nonparametric statistical tests. Note is then taken of which self-labeled models are the best-performing ones. Moreover, a semi-supervised learning module has been developed for the Knowledge Extraction based on Evolutionary Learning software, integrating analyzed methods and data sets.

Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study

期刊

KNOWLEDGE AND INFORMATION SYSTEMS

出版社

SPRINGER LONDON LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study

期刊

KNOWLEDGE AND INFORMATION SYSTEMS

出版社

SPRINGER LONDON LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文