☆ 4.7 Article

Semi-supervised classifier ensemble model for high-dimensional data

INFORMATION SCIENCES (2023)

期刊

INFORMATION SCIENCES

卷 643, 期 -, 页码 -

出版社

ELSEVIER SCIENCE INC

DOI: 10.1016/j.ins.2023.119203

关键词

Ensemble learning; Semi-supervised learning; Classification; High-dimensional data

类别

Computer Science, Information Systems

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

To tackle the challenging task of high-dimensional data classification with limited labeled samples, we propose two semi-supervised learning models, SSRS and its adaptive version, ASSRS. These models address the unique characteristics of high-dimensional data by selecting subspaces of sample and feature dimensions and reducing dimensions. By incorporating sample-labeling auxiliary algorithm, adaptive sample subspace algorithm, and adaptive weight voting rule, ASSRS outperforms SSRS in terms of performance. Experiments demonstrate that SSRS and ASSRS perform better than other competitive algorithms and accurately label samples in datasets with limited labeled samples.

To complete the challenging task of high-dimensional data classification with limited labeled samples, we propose two semi-supervised learning models, namely the random subspace classifier ensemble model (SSRS) and its adaptive version (ASSRS). Considering the unique characteristics of high-dimensional data, SSRS selects subspaces of sample and feature dimensions and then reduces the dimensions of each subspace. To improve SSRS performance further, we designed a sample-labeling auxiliary algorithm, adaptive sample subspace algorithm, and adaptive weight voting rule for ASSRS to increase the proportion of labeled samples, obtain a suitable sample subspace for each feature subspace, and acquire a relative optimal weight for each base classifier. Experiments revealed that the performances of SSRS and ASSRS were better than those of other competitive algorithms and that the performance of ASSRS was stronger than that of SSRS. Additionally, we can accurately label samples in datasets where the proportion of labeled samples is relatively low by using SSRS and ASSRS. Because analysts are facing large numbers of high -dimensional datasets with limited labels, it is important to make accurate predictions based on a limited proportion of labeled data.

Semi-supervised classifier ensemble model for high-dimensional data

期刊

INFORMATION SCIENCES

出版社

ELSEVIER SCIENCE INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Semi-supervised classifier ensemble model for high-dimensional data

期刊

INFORMATION SCIENCES

出版社

ELSEVIER SCIENCE INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文