4.8 Article

Out-of-the-box deep learning prediction of pharmaceutical properties by broadly learned knowledge-based molecular representations

Journal

NATURE MACHINE INTELLIGENCE
Volume 3, Issue 4, Pages 334-+

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s42256-021-00301-6

Keywords

-

Funding

  1. National Key R&D Program of China, Synthetic Biology Research [2019YFA0905900]
  2. Shenzhen Municipal Government grants [2019156, JCYJ20170413113448742, 201901]
  3. Department of Science and Technology of Guangdong Province [2017B030314083]
  4. Singapore Academic Funds [R-148-000-273-114]

Ask authors/readers for more resources

The study demonstrates that combining human knowledge-based molecular representations with convolutional neural networks can enhance deep learning of pharmaceutical properties. By extensively learning molecular descriptors and fingerprint features, the MolMap method was developed and MolMapNet models were constructed, which outperformed other models on various benchmark datasets and a novel dataset.
Successful deep learning critically depends on the representation of the learned objects. Recent state-of-the-art pharmaceutical deep learning models successfully exploit graph-based de novo learning of molecular representations. Nonetheless, the combined potential of human expert knowledge of molecular representations and convolution neural networks has not been adequately explored for enhanced learning of pharmaceutical properties. Here we show that broader exploration of human-knowledge-based molecular representations enables more enhanced deep learning of pharmaceutical properties. By broad learning of 1,456 molecular descriptors and 16,204 fingerprint features of 8,506,205 molecules, a new feature-generation method MolMap was developed for mapping these molecular descriptors and fingerprint features into robust two-dimensional feature maps. Convolution-neural-network-based MolMapNet models were constructed for out-of-the-box deep learning of pharmaceutical properties, which outperformed the graph-based and other established models on most of the 26 pharmaceutically relevant benchmark datasets and a novel dataset. The MolMapNet learned important features that are consistent with the literature-reported molecular features. While deep learning models have allowed the extraction of fingerprints from the structural description of molecules, they can miss information that is present in the molecular descriptors that chemists use. Shen and colleagues present a method to combine both sources of information into two-dimensional fingerprint maps, which can be used in a wide variety of pharmaceutical tasks to predict the properties of drugs.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available