☆ 4.6 Article

Predicting aggregate morphology of sequence-defined macromolecules with recurrent neural networks

SOFT MATTER (2022)

期刊

SOFT MATTER

卷 18, 期 27, 页码 5037-5051

出版社

ROYAL SOC CHEMISTRY

DOI: 10.1039/d2sm00452f

关键词

类别

Chemistry, Physical Materials Science, Multidisciplinary Physics, Multidisciplinary Polymer Science

资金

Department of Materials Science and Engineering at the Pennsylvania State University
Institute for Computational and Data Sciences at the Pennsylvania State University
Materials Research Institute at the Pennsylvania State University

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Self-assembly of dilute sequence-defined macromolecules is a complex phenomenon with a dependence on the sequence. In this study, supervised machine learning was used to accurately predict the morphology of aggregates formed by these macromolecules. High-throughput screening was also demonstrated to identify suitable sequences for self-assembly.

Self-assembly of dilute sequence-defined macromolecules is a complex phenomenon in which the local arrangement of chemical moieties can lead to the formation of long-range structure. The dependence of this structure on the sequence necessarily implies that a mapping between the two exists, yet it has been difficult to model so far. Predicting the aggregation behavior of these macromolecules is challenging due to the lack of effective order parameters, a vast design space, inherent variability, and high computational costs associated with currently available simulation techniques. Here, we accurately predict the morphology of aggregates self-assembled from sequence-defined macromolecules using supervised machine learning. We find that regression models with implicit representation learning perform significantly better than those based on engineered features such as k-mer counting, and a recurrent-neural-network-based regressor performs the best out of nine model architectures we tested. Furthermore, we demonstrate the high-throughput screening of monomer sequences using the regression model to identify candidates for self-assembly into selected morphologies. Our strategy is shown to successfully identify multiple suitable sequences in every test we performed, so we hope the insights gained here can be extended to other increasingly complex design scenarios in the future, such as the design of sequences under polydispersity and at varying environmental conditions.

Predicting aggregate morphology of sequence-defined macromolecules with recurrent neural networks

期刊

SOFT MATTER

出版社

ROYAL SOC CHEMISTRY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Predicting aggregate morphology of sequence-defined macromolecules with recurrent neural networks

期刊

SOFT MATTER

出版社

ROYAL SOC CHEMISTRY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文