4.6 Article

Adaptive Semi-Supervised Classifier Ensemble for High Dimensional Data Classification

Journal

IEEE TRANSACTIONS ON CYBERNETICS
Volume 49, Issue 2, Pages 366-379

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCYB.2017.2761908

Keywords

Classification; ensemble learning; feature selection; high dimensional data; optimization; semi-supervised learning

Funding

  1. NSFC [61722205, 61572199, 61572540, U1611461]
  2. Guangdong Natural Science Funds [S2013050014677, 2017A030312008]
  3. Science and Technology Planning Project of Guangdong Province, China [2015A050502011, 2016B090918042, 2016A050503015, 2016B010127003]
  4. Macau Science and Technology Development [019/2015/A, 024/2015/AMJ]
  5. Multiyear Research Grants from the University of Macau Multiyear Research Grants
  6. Research Grants Council of the Hong Kong Special Administrative Region, China [CityU 11300715]
  7. Hong Kong General Research Grant [152202/14E]
  8. PolyU Central Research Grant
  9. City University of Hong Kong [7004884]

Ask authors/readers for more resources

High dimensional data classification with very limited labeled training data is a challenging task in the area of data mining. In order to tackle this task, we first propose a feature selection-based semi-supervised classifier ensemble framework (FSCE) to perform high dimensional data classification. Then, we design an adaptive semi-supervised classifier ensemble framework (ASCE) to improve the performance of FSCE. When compared with FSCE, ASCE is characterized by an adaptive feature selection process, an adaptive weighting process (AWP), and an auxiliary training set generation process (ATSGP). The adaptive feature selection process generates a set of compact subspaces based on the selected attributes obtained by the feature selection algorithms, while the AWP associates each basic semi-supervised classifier in the ensemble with a weight value. The ATSGP enlarges the training set with unlabeled samples. In addition, a set of nonparametric tests are adopted to compare multiple semi-supervised classifier ensemble (SSCE) approaches over different datasets. The experiments on 20 high dimensional real-world datasets show that: 1) the two adaptive processes in ASCE are useful for improving the performance of the SSCE approach and 2) ASCE works well on high dimensional datasets with very limited labeled training data, and outperforms most state-of-the-art SSCE approaches.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available