4.1 Article

Dynameomics: design of a computational lab workflow and scientific data repository for protein simulations

期刊

PROTEIN ENGINEERING DESIGN & SELECTION
卷 21, 期 6, 页码 369-377

出版社

OXFORD UNIV PRESS
DOI: 10.1093/protein/gzn012

关键词

data warehouse; database; Dynameomics; OLAP; protein dynamics

资金

  1. NLM NIH HHS [3 T15 LM007442-04S1] Funding Source: Medline
  2. NATIONAL LIBRARY OF MEDICINE [T15LM007442] Funding Source: NIH RePORTER

向作者/读者索取更多资源

Dynameomics is a project to investigate and catalog the native-state dynamics and thermal unfolding pathways of representatives of all protein folds using solvated molecular dynamics simulations, as described in the preceding paper. Here we introduce the design of the molecular dynamics data warehouse, a scalable, reliable repository that houses simulation data that vastly simplifies management and access. In the succeeding paper, we describe the development of a complementary multidimensional database. A single protein unfolding or native-state simulation can take weeks to months to complete, and produces gigabytes of coordinate and analysis data. Mining information from over 3000 completed simulations is complicated and time-consuming. Even the simplest queries involve writing intricate programs that must be built from low-level file system access primitives and include significant logic to correctly locate and parse data of interest. As a result, programs to answer questions that require data from hundreds of simulations are very difficult to write. Thus, organization and access to simulation data have been major obstacles to the discovery of new knowledge in the Dynameomics project. This repository is used internally and is the foundation of the Dynameomics portal site http://www.dynameomics.org. By organizing simulation data into a scalable, manageable and accessible form, we can begin to address substantial questions that move us closer to solving biomedical and bioengineering problems.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.1
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据