4.5 Article

Large-scale machine learning based on functional networks for biomedical big data with high performance computing platforms

Journal

JOURNAL OF COMPUTATIONAL SCIENCE
Volume 11, Issue -, Pages 69-81

Publisher

ELSEVIER SCIENCE BV
DOI: 10.1016/j.jocs.2015.09.008

Keywords

Machine learning; Large scale high performance computing; Biomedical data and healthcare; Functional networks; Propensity score; Big data; MapReduce; Google Sibyl; Spark

Funding

  1. Sidra Medical and Research Center, Doha, Qatar
  2. IBM team

Ask authors/readers for more resources

Currently, the exponential growth of biomedical data along with the complexities of managing high dimensionality, imbalanced distribution, sparse attributes instigates a difficult challenge of effectively applying functional networks as a new large-scale predictive modeling in healthcare and biomedicine. This article proposes functional networks based on propensity score and Newton Raphson-maximum-likelihood optimizations as a new large-scale machine learning classifier to enhance its performance in addressing these challenges within big biomedical data. Different use-cases scenarios based on integrated phenotypic and genomics big biomedical data were proposed: real-life biomedical data, (i) optimal design of cancer chemotherapy; (ii) identify inpatient-admission of individuals with primary diagnosis of cancer; (iii) identify severe asthma exacerbation children using integrated phenotypic and SNP repository data; and (iv) mixture models simulation studies. Comparative studies were carried to compare the performance of the new paradigm versus the common state-of-the-art of machine learning, data mining, and statistics schemes. The results of performance of the new classifier with the most common classifiers on the four benchmark databases have been recorded in tables and graphs. The obtained results of the new classifier outperform most of existing state-of-the art statistical machine learning schemes with reliable and efficient performance. The new predictive modeling classifier is saving the computational time and having reliable performances along with future avenue for extension to deal with next generation sequencing data on high performance computing platforms. (C) 2015 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available