4.7 Article

A Fast SVD-Hidden-nodes based Extreme Learning Machine for Large-Scale Data Analytics

Journal

NEURAL NETWORKS
Volume 77, Issue -, Pages 14-28

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.neunet.2015.09.003

Keywords

Extreme Learning Machine; Singular value decomposition; Big data; Big dimensional data; Fast approximation method

Funding

  1. ASTAR Thematic Strategic Research Programme (TSRP) Grant [1121720013]
  2. Computational Intelligence Research Laboratory at NTU
  3. National Science Foundation of China [61572399, 61532015, 61373116]
  4. Shaanxi New Star of Science Technology [2013KJXX-29]
  5. New Star Team of Xian University of Posts & Telecommunications
  6. Provincial Key Disciplines Construction Fund of General Institutions of Higher Education in Shaanxi

Ask authors/readers for more resources

Big dimensional data is a growing trend that is emerging in many real world contexts, extending from web mining, gene expression analysis, protein-protein interaction to high-frequency financial data. Nowadays, there is a growing consensus that the increasing dimensionality poses impeding effects on the performances of classifiers, which is termed as the peaking phenomenon'' in the field of machine intelligence. To address the issue, dimensionality reduction is commonly employed as a preprocessing step on the Big dimensional data before building the classifiers. In this paper, we propose an Extreme Learning Machine (ELM) approach for large-scale data analytic. In contrast to existing approaches, we embed hidden nodes that are designed using singular value decomposition (SVD) into the classical ELM. These SVD nodes in the hidden layer are shown to capture the underlying characteristics of the Big dimensional data well, exhibiting excellent generalization performances. The drawback of using SVD on the entire dataset, however, is the high computational complexity involved. To address this, a fast divide and conquer approximation scheme is introduced to maintain computational tractability on high volume data. The resultant algorithm proposed is labeled here as Fast Singular Value Decomposition-Hidden-nodes based Extreme Learning Machine or FSVD-H-ELM in short. In FSVD-H-ELM, instead of identifying the SVD hidden nodes directly from the entire dataset, SVD hidden nodes are derived from multiple random subsets of data sampled from the original dataset. Comprehensive experiments and comparisons are conducted to assess the FSVD-H-ELM against other state-of-the-art algorithms. The results obtained demonstrated the superior generalization performance and efficiency of the FSVD-H-ELM. (C) 2016 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available