4.6 Article

A Projection Pursuit framework for supervised dimension reduction of high dimensional small sample datasets

Journal

NEUROCOMPUTING
Volume 149, Issue -, Pages 767-776

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2014.07.057

Keywords

Projection Pursuit; Classification; Gene expression; Dimension reduction

Funding

  1. CNPq (Conselho Nacional de Desenvolvimento Cientifico e Tecnologico) [151547/2013-0]
  2. FAPESP (Sao Paulo Research Foundation) [2012/22295-0]

Ask authors/readers for more resources

The analysis and interpretation of datasets with large number of features and few examples has remained as a challenging problem in the scientific community, owing to the difficulties associated with the curse-of-the-dimensionality phenomenon. Projection Pursuit (PP) has shown promise in circumventing this phenomenon by searching low-dimensional projections of the data where meaningful structures are exposed. However, PP faces computational difficulties in dealing with datasets containing thousands of features (typical in genomics and proteomics) due to the vast quantity of parameters to optimize. In this paper we describe and evaluate a PP framework aimed at relieving such difficulties and thus ease the construction of classifier systems. The framework is a two-stage approach, where the first stage performs a rapid compaction of the data and the second stage implements the PP search using an improved version of the SPP method (Guo et al., 2000, 132]). In an experimental evaluation with eight public microarray datasets we showed that some configurations of the proposed framework can clearly overtake the performance of eight well-established dimension reduction methods in their ability to pack more discriminatory information into fewer dimensions. (C) 2014 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available