4.5 Article

UMAP as a Dimensionality Reduction Tool for Molecular Dynamics Simulations of Biomacromolecules: A Comparison Study

期刊

JOURNAL OF PHYSICAL CHEMISTRY B
卷 125, 期 19, 页码 5022-5034

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jpcb.1c02081

关键词

-

资金

  1. National Institute of General Medical Sciences of the National Institutes of Health [R15GM122013]

向作者/读者索取更多资源

This study compared the performance of different dimensionality reduction methods in analyzing molecular dynamics simulations of the circadian clock protein VIVID, showing that UMAP outperforms linear methods like PCA and tICA in accurately representing data structures with competitive performance and scalable computational cost.
Proteins are the molecular machines of life. The multitude of possible conformations that proteins can adopt determines their free-energy landscapes. However, the inherently high dimensionality of a protein free-energy landscape poses a challenge to deciphering how proteins perform their functions. For this reason, dimensionality reduction is an active field of research for molecular biologists. The uniform manifold approximation and projection (UMAP) is a dimensionality reduction method based on a fuzzy topological analysis of data. In the present study, the performance of UMAP is compared with that of other popular dimensionality reduction methods such as t-distributed stochastic neighbor embedding (t-SNE), principal component analysis (PCA), and time-structure independent components analysis (tICA) in the context of analyzing molecular dynamics simulations of the circadian clock protein VIVID. A good dimensionality reduction method should accurately represent the data structure on the projected components. The comparison of the raw high-dimensional data with the projections obtained using different dimensionality reduction methods based on various metrics showed that UMAP has superior performance when compared with linear reduction methods (PCA and tICA) and has competitive performance and scalable computational cost.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据