4.6 Article

Multiobjective Semisupervised Classifier Ensemble

Journal

IEEE TRANSACTIONS ON CYBERNETICS
Volume 49, Issue 6, Pages 2280-2293

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCYB.2018.2824299

Keywords

Ensemble learning; feature selection; multiobjective optimization; semisupervised learning

Funding

  1. NSFC [61722205, 61751205, 61572199, 61572540, 61472145, U1611461]
  2. Guangdong Natural Science Funds [S2013050014677, 2017A030312008]
  3. Science and Technology Planning Project of Guangdong Province, China [2015A050502011, 2016B090918042, 2016A050503015, 2016B010127003]
  4. Guangzhou Science and Technology Planning Project [201704030051]
  5. Macau Science and Technology Development [019/2015/A, 024/2015/AMJ]
  6. Multiyear Research Grants through the University of Macau Multiyear Research
  7. Research Grants Council of the Hong Kong Special Administrative Region, China [CityU 11300715]
  8. Hong Kong General Research Grant [152202/14E]
  9. PolyU Central Research Grant under Grant G-YBJW
  10. City University of Hong Kong [7004884]

Ask authors/readers for more resources

Classification of high-dimensional data with very limited labels is a challenging task in the field of data mining and machine learning. In this paper, we propose the multiobjective semisupervised classifier ensemble (MOSSCE) approach to address this challenge. Specifically, a multiobjective subspace selection process (MOSSP) in MOSSCE is first designed to generate the optimal combination of feature subspaces. Three objective functions are then proposed for MOSSP, which include the relevance of features, the redundancy between features, and the data reconstruction error. Then, MOSSCE generates an auxiliary training set based on the sample confidence to improve the performance of the classifier ensemble. Finally, the training set, combined with the auxiliary training set, is used to select the optimal combination of basic classifiers in the ensemble, train the classifier ensemble, and generate the final result. In addition, diversity analysis of the ensemble learning process is applied, and a set of nonparametric statistical tests is adopted for the comparison of semisupervised classification approaches on multiple datasets. The experiments on 12 gene expression datasets and two large image datasets show that MOSSCE has a better performance than other state-of-the-art semisupervised classifiers on high-dimensional data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available