4.7 Article

Hybrid feature selection algorithm using symmetrical uncertainty and a harmony search algorithm

Journal

INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE
Volume 47, Issue 6, Pages 1312-1329

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/00207721.2014.924600

Keywords

feature selection; filter; harmony search; microarray; wrapper

Funding

  1. Malaysia Ministry of Education research grant [FRGS/1/2012/SG05/UKM/03/3, UKM-TT07-FRGS0157-2010]

Ask authors/readers for more resources

Microarray technology can be used as an efficient diagnostic system to recognise diseases such as tumours or to discriminate between different types of cancers in normal tissues. This technology has received increasing attention from the bioinformatics community because of its potential in designing powerful decision-making tools for cancer diagnosis. However, the presence of thousands or tens of thousands of genes affects the predictive accuracy of this technology from the perspective of classification. Thus, a key issue in microarray data is identifying or selecting the smallest possible set of genes from the input data that can achieve good predictive accuracy for classification. In this work, we propose a two-stage selection algorithm for gene selection problems in microarray data-sets called the symmetrical uncertainty filter and harmony search algorithm wrapper (SU-HSA). Experimental results show that the SU-HSA is better than HSA in isolation for all data-sets in terms of the accuracy and achieves a lower number of genes on 6 out of 10 instances. Furthermore, the comparison with state-of-the-art methods shows that our proposed approach is able to obtain 5 (out of 10) new best results in terms of the number of selected genes and competitive results in terms of the classification accuracy.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available