4.6 Review

Data Lakes, Clouds, and Commons: A Review of Platforms for Analyzing and Sharing Genomic Data

期刊

TRENDS IN GENETICS
卷 35, 期 3, 页码 223-234

出版社

ELSEVIER SCIENCE LONDON
DOI: 10.1016/j.tig.2018.12.006

关键词

-

资金

  1. NCI, NIH [17X053, 14X050, HHSN261200800001E]

向作者/读者索取更多资源

Data commons collate data with cloud computing infrastructure and commonly used software services, tools, and applications to create biomedical resources for the large-scale management, analysis, harmonization, and sharing of biomedical data. Over the past few years, data commons have been used to analyze, harmonize, and share large-scale genomics datasets. Data ecosystems can be built by interoperating multiple data commons. It can be quite labor intensive to curate, import, and analyze the data in a data commons. Data lakes provide an alternative to data commons and simply provide access to data, with the data curation and analysis deferred until later and delegated to those that access the data. We review software platforms for managing, analyzing, and sharing genomic data, with an emphasis on data commons, but also cover data ecosystems and data lakes.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据