4.6 Article

Information-theoretical measures identify accurate low-resolution representations of protein configurational space

期刊

SOFT MATTER
卷 18, 期 37, 页码 7064-7074

出版社

ROYAL SOC CHEMISTRY
DOI: 10.1039/d2sm00636g

关键词

-

资金

  1. Frankfurt Institute for Advanced Studies
  2. LOEWE Center for Multiscale Modelling in Life Sciences of the state of Hesse
  3. European Research Council (ERC) under the European Union [758588]
  4. European Research Council (ERC) [758588] Funding Source: European Research Council (ERC)

向作者/读者索取更多资源

This study addresses the problem of coarsening the conformational space of proteins in molecular dynamics simulations. Using an information-theoretical framework, the researchers identified the optimal resolution level that balances simplicity and informativeness, and discovered that clustering methods inducing an ultrametric structure in the low-resolution space are the most physically accurate. The proposed strategy has applicability beyond computational biophysics and can be a valuable tool for extracting useful information from large datasets.
The steadily growing computational power employed to perform molecular dynamics simulations of biological macromolecules represents at the same time an immense opportunity and a formidable challenge. In fact, large amounts of data are produced, from which useful, synthetic, and intelligible information has to be extracted to make the crucial step from knowing to understanding. Here we tackled the problem of coarsening the conformational space sampled by proteins in the course of molecular dynamics simulations. We applied different schemes to cluster the frames of a dataset of protein simulations; we then employed an information-theoretical framework, based on the notion of resolution and relevance, to gauge how well the various clustering methods accomplish this simplification of the configurational space. Our approach allowed us to identify the level of resolution that optimally balances simplicity and informativeness; furthermore, we found that the most physically accurate clustering procedures are those that induce an ultrametric structure of the low-resolution space, consistently with the hypothesis that the protein conformational landscape has a self-similar organisation. The proposed strategy is general and its applicability extends beyond that of computational biophysics, making it a valuable tool to extract useful information from large datasets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据