4.5 Article

Large-scale machine learning based on functional networks for biomedical big data with high performance computing platforms

期刊

JOURNAL OF COMPUTATIONAL SCIENCE
卷 11, 期 -, 页码 69-81

出版社

ELSEVIER SCIENCE BV
DOI: 10.1016/j.jocs.2015.09.008

关键词

Machine learning; Large scale high performance computing; Biomedical data and healthcare; Functional networks; Propensity score; Big data; MapReduce; Google Sibyl; Spark

资金

  1. Sidra Medical and Research Center, Doha, Qatar
  2. IBM team

向作者/读者索取更多资源

Currently, the exponential growth of biomedical data along with the complexities of managing high dimensionality, imbalanced distribution, sparse attributes instigates a difficult challenge of effectively applying functional networks as a new large-scale predictive modeling in healthcare and biomedicine. This article proposes functional networks based on propensity score and Newton Raphson-maximum-likelihood optimizations as a new large-scale machine learning classifier to enhance its performance in addressing these challenges within big biomedical data. Different use-cases scenarios based on integrated phenotypic and genomics big biomedical data were proposed: real-life biomedical data, (i) optimal design of cancer chemotherapy; (ii) identify inpatient-admission of individuals with primary diagnosis of cancer; (iii) identify severe asthma exacerbation children using integrated phenotypic and SNP repository data; and (iv) mixture models simulation studies. Comparative studies were carried to compare the performance of the new paradigm versus the common state-of-the-art of machine learning, data mining, and statistics schemes. The results of performance of the new classifier with the most common classifiers on the four benchmark databases have been recorded in tables and graphs. The obtained results of the new classifier outperform most of existing state-of-the art statistical machine learning schemes with reliable and efficient performance. The new predictive modeling classifier is saving the computational time and having reliable performances along with future avenue for extension to deal with next generation sequencing data on high performance computing platforms. (C) 2015 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据