4.6 Article

A novel features ranking metric with application to scalable visual and bioinformatics data classification

Journal

NEUROCOMPUTING
Volume 173, Issue -, Pages 346-354

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2014.12.123

Keywords

Dimensionality reduction; Protein-protein interaction; Image classification; Feature ranking

Funding

  1. Natural Science Foundation of China [61370010, 61422210, 61373076]
  2. Natural Science Foundation of Fujian Province of China [2014J01253]
  3. Special Fund for Earthquake Research in the Public Interest [201508025]
  4. Major Program of The National Social Science Foundation of China [13ZD148]
  5. Fundamental Research Funds for the Central Universities [2013121026]
  6. opening Fund of Shanghai Key Laboratory of Intelligent Information Processing [IIPL-2014-004]

Ask authors/readers for more resources

Coming with the big data era, the filtering of uninformative data becomes emerging. To this end, ranking the high dimensionality features plays an important role. However, most of the state-of-art methods focus on improving the classification accuracy while the stability of the dimensionality reduction is simply ignored. In this paper, we proposed a Max-Relevance-Max-Distance (MRMD) feature ranking method, which balances accuracy and stability of feature ranking and prediction task. In order to prove the effectiveness on big data, we tested our method on two different datasets. The first one is image classification, which is a benchmark dataset with high dimensionality, while the second one is protein-protein interaction prediction data, which comes from our previous private research and has massive instances. Experiments prove that our method maintained the accuracy together with the stability on both two big datasets. Moreover, our method runs faster than other filtering and wrapping methods, such as mRMR and Information Gain. (C) 2015 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available