4.7 Article

Identifying Structure-Property Relationships through SMILES Syntax Analysis with Self-Attention Mechanism

期刊

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jcim.8b00803

关键词

-

资金

  1. the Science & Technology Program of Guangzhou [201604020109]
  2. Science & Technology Planning Project of Guangdong Province [2016A020217005]
  3. GD Frontier & Key Techn. Innovation Program [2015B010109004]
  4. GD-NSF [2016A030310228]
  5. National Key R&D Program of China [2017YFB02034043]
  6. Guangdong Provincial Key Lab. of Construction Foundation [2011A060901014]
  7. Natural Science Foundation of China [U1611261, 61772566]
  8. program for Guangdong Introducing Innovative and Entrepreneurial Teams [2016ZT06D211]

向作者/读者索取更多资源

Recognizing substructures and their relations embedded in a molecular structure representation is a key process for structure-activity or structure-property relationship (SAR/SPR) studies. A molecular structure can be explicitly represented as either a connection table (CT) or linear notation, such as SMILES, which is a language describing the connectivity of atoms in the molecular structure. Conventional SAR/SPR approaches rely on partitioning the CT into a set of predefined substructures as structural descriptors. In this work, we propose a new method to identifying SAR/SPR through linear notation (for example, SMILES) syntax analysis with self-attention mechanism, an interpretable deep learning architecture. The method has been evaluated by predicting chemical properties, toxicology, and bioactivity from experimental data sets. Our results demonstrate that the method yields superior performance compared with state-of-the-art models. Moreover, the method can produce chemically interpretable results, which can be used for a chemist to design and synthesize the activity- or property-improved compounds.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据