4.8 Article

Bioactivity descriptors for uncharacterized chemical compounds

Journal

NATURE COMMUNICATIONS
Volume 12, Issue 1, Pages -

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41467-021-24150-4

Keywords

-

Funding

  1. Generalitat de Catalunya [CECH: 001-P-001682, VEIS: 001-P-001647]
  2. Spanish Ministerio de Economia y Competitividad [BIO2016-77038-R]
  3. European Research Council [SysPharmAD: 614944]
  4. European Commission [RiPCoN: 101003633]
  5. Agencia Estatal de Investigacion (AEI)
  6. Fondos FEDER [PID2019-104698RB-I00]

Ask authors/readers for more resources

Chemical descriptors encode the physicochemical and structural properties of small molecules and are at the core of chemoinformatics. The authors introduce deep neural networks capable of inferring bioactivity signatures for any compound of interest, even with little or no experimental information. Signatures relate to 25 different types of bioactivities and can be used as replacements for chemical descriptors in daily chemoinformatics tasks.
Chemical descriptors encode the physicochemical and structural properties of small molecules, and they are at the core of chemoinformatics. The broad release of bioactivity data has prompted enriched representations of compounds, reaching beyond chemical structures and capturing their known biological properties. Unfortunately, bioactivity descriptors are not available for most small molecules, which limits their applicability to a few thousand well characterized compounds. Here we present a collection of deep neural networks able to infer bioactivity signatures for any compound of interest, even when little or no experimental information is available for them. Our signaturizers relate to bioactivities of 25 different types (including target profiles, cellular response and clinical outcomes) and can be used as drop-in replacements for chemical descriptors in day-to-day chemoinformatics tasks. Indeed, we illustrate how inferred bioactivity signatures are useful to navigate the chemical space in a biologically relevant manner, unveiling higher-order organization in natural product collections, and to enrich mostly uncharacterized chemical libraries for activity against the drug-orphan target Snail1. Moreover, we implement a battery of signature-activity relationship (SigAR) models and show a substantial improvement in performance, with respect to chemistry-based classifiers, across a series of biophysics and physiology activity prediction benchmarks. Small molecules bioactivity descriptors are enriched representations of compounds, reaching beyond chemical structures and capturing their known biological properties. Here the authors present a collection of deep neural networks able to infer bioactivity signatures for any compound of interest, even when little or no experimental information is available for them.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available