4.7 Article

Information content and analysis methods for Multi-Modal High-Throughput Biomedical Data

期刊

SCIENTIFIC REPORTS
卷 4, 期 -, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/srep04411

关键词

-

资金

  1. National Center for Research Resources, National Institutes of Health [1UL1 RR029893]
  2. Cancer Research UK
  3. British Columbia Cancer Agency Branch
  4. Seventh Framework Programme [EU-FP7-ICT-2007-2-22483-NeoMark]

向作者/读者索取更多资源

The spectrum of modern molecular high-throughput assaying includes diverse technologies such as microarray gene expression, miRNA expression, proteomics, DNA methylation, among many others. Now that these technologies have matured and become increasingly accessible, the next frontier is to collect multi-modal'' data for the same set of subjects and conduct integrative, multi-level analyses. While multi-modal data does contain distinct biological information that can be useful for answering complex biology questions, its value for predicting clinical phenotypes and contributions of each type of input remain unknown. We obtained 47 datasets/predictive tasks that in total span over 9 data modalities and executed analytic experiments for predicting various clinical phenotypes and outcomes. First, we analyzed each modality separately using uni-modal approaches based on several state-of-the-art supervised classification and feature selection methods. Then, we applied integrative multi-modal classification techniques. We have found that gene expression is the most predictively informative modality. Other modalities such as protein expression, miRNA expression, and DNA methylation also provide highly predictive results, which are often statistically comparable but not superior to gene expression data. Integrative multi-modal analyses generally do not increase predictive signal compared to gene expression data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据