4.7 Article

Machine learning for integrating data in biology and medicine: Principles, practice, and opportunities

期刊

INFORMATION FUSION
卷 50, 期 -, 页码 71-91

出版社

ELSEVIER
DOI: 10.1016/j.inffus.2018.09.012

关键词

Computational biology; Personalized medicine; Systems biology; Heterogeneous data; Machine learning

资金

  1. National Science Foundation [IIS-1149837]
  2. NIH BD2K [U54EB020405]
  3. DARPA SIMPLEX
  4. Natural Sciences and Natural Sciences and Engineering Research Council of Canada [RGPIN-2015-03948]
  5. Stanford Data Science Initiative
  6. Chan Zuckerberg Biohub

向作者/读者索取更多资源

New technologies have enabled the investigation of biology and human health at an unprecedented scale and in multiple dimensions. These dimensions include a myriad of properties describing genome, epigenome, transcriptome, microbiome, phenotype, and lifestyle. No single data type, however, can capture the complexity of all the factors relevant to understanding a phenomenon such as a disease. Integrative methods that combine data from multiple technologies have thus emerged as critical statistical and computational approaches. The key challenge in developing such approaches is the identification of effective models to provide a comprehensive and relevant systems view. An ideal method can answer a biological or medical question, identifying important features and predicting outcomes, by harnessing heterogeneous data across several dimensions of biological variation. In this Review, we describe the principles of data integration and discuss current methods and available implementations. We provide examples of successful data integration in biology and medicine. Finally, we discuss current challenges in biomedical integrative methods and our perspective on the future development of the field.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据