4.5 Review

Reprint of Abstraction for data integration: Fusing mammalian molecular, cellular and phenotype big datasets for better knowledge extraction

Journal

COMPUTATIONAL BIOLOGY AND CHEMISTRY
Volume 59, Issue -, Pages 123-138

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.compbiolchem.2015.08.005

Keywords

Data integration; Bioinformatics; Systems biology; Systems pharmacology; Network biology

Funding

  1. NIH [U54HL127624, U54CA189201, R01GM098316, T32HL007824]

Ask authors/readers for more resources

With advances in genomics, transcriptomics, metabolomics and proteomics, and more expansive electronic clinical record monitoring, as well as advances in computation, we have entered the Big Data era in biomedical research. Data gathering is growing rapidly while only a small fraction of this data is converted to useful knowledge or reused in future studies. To improve this, an important concept that is often overlooked is data abstraction. To fuse and reuse biomedical datasets from diverse resources, data abstraction is frequently required. Here we summarize some of the major Big Data biomedical research resources for genomics, proteomics and phenotype data, collected from mammalian cells, tissues and organisms. We then suggest simple data abstraction methods for fusing this diverse but related data. Finally, we demonstrate examples of the potential utility of such data integration efforts, while warning about the inherit biases that exist within such data. (C) 2015 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available