4.8 Review

Unsupervised Learning Methods for Molecular Simulation Data

期刊

CHEMICAL REVIEWS
卷 121, 期 16, 页码 9722-9758

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.chemrev.0c01195

关键词

-

资金

  1. European Union's Horizon 2020 research and innovation program [824143]
  2. European Commission [ERC CoG 772230]
  3. MATH+, the Berlin Mathematics Center [EF1-2]
  4. BMBF (Berlin Institute for the Foundations of Learning and Data - BIFOLD)
  5. National Science Foundation [CHE-1738990, CHE-1900374, PHY-1427654]
  6. Welch Foundation [C-1570]
  7. Deutsche Forschungsgemeinschaft [SFB/TRR 186/A12, SFB 1078/C7]
  8. Einstein Foundation Berlin

向作者/读者索取更多资源

This review provides a comprehensive overview of the importance of unsupervised learning in analyzing data from atomistic and molecular simulations, discussing state-of-the-art algorithms and methods in feature representation, dimensionality reduction, density estimation, clustering, and kinetic models. The article is well-structured, with detailed discussions in each section on the mathematical and algorithmic foundations, strengths and limitations of each method, and specific applications in analyzing molecular simulation data.
Unsupervised learning is becoming an essential tool to analyze the increasingly large amounts of data produced by atomistic and molecular simulations, in material science, solid state physics, biophysics, and biochemistry. In this Review, we provide a comprehensive overview of the methods of unsupervised learning that have been most commonly used to investigate simulation data and indicate likely directions for further developments in the field. In particular, we discuss feature representation of molecular systems and present state-of-the-art algorithms of dimensionality reduction, density estimation, and clustering, and kinetic models. We divide our discussion into self-contained sections, each discussing a specific method. In each section, we briefly touch upon the mathematical and algorithmic foundations of the method, highlight its strengths and limitations, and describe the specific ways in which it has been used-or can be used-to analyze molecular simulation data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据