4.7 Article

Comparison of Chemical Structure and Cell Morphology Information for Multitask Bioactivity Predictions

期刊

JOURNAL OF CHEMICAL INFORMATION AND MODELING
卷 61, 期 3, 页码 1444-1456

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jcim.0c00864

关键词

-

资金

  1. Biotechnology and Biological Sciences Research Council (BBSRC) [BB/M011194/1]
  2. AstraZeneca
  3. BBSRC [1944644] Funding Source: UKRI

向作者/读者索取更多资源

Understanding the mechanism-of-action of compounds and predicting potential drug targets are crucial in small-molecule drug discovery. This study compared chemical and cell morphology information for bioactivity prediction, finding that BMF Macau outperformed RF when image data were used as compound information. Both cell morphology changes and chemical structure information contain valuable information about compound bioactivity, with some targets better predicted by image data and others by fingerprint-based side information.
The understanding of the mechanism-of-action (MoA) of compounds and the prediction of potential drug targets play an important role in small-molecule drug discovery. The aim of this work was to compare chemical and cell morphology information for bioactivity prediction. The comparison was performed using bioactivity data from the ExCAPE database, image data (in the form of CellProfiler features) from the Cell Painting data set (the largest publicly available data set of cell images with similar to 30,000 compound perturbations), and extended connectivity fingerprints (ECFPs) using the multitask Bayesian matrix factorization (BMF) approach Macau. We found that the BMF Macau and random forest (RF) performance were overall similar when ECFPs were used as compound descriptors. However, BMF Macau outperformed RF in 159 out of 224 targets (71%) when image data were used as compound information. Using BMF Macau, 100 (corresponding to about 45%) and 90 (about 40%) of the 224 targets were predicted with high predictive performance (AUC > 0.8) with ECFP data and image data as side information, respectively. There were targets better predicted by image data as side information, such as beta-catenin, and others better predicted by fingerprint-based side information, such as proteins belonging to the G-protein-Coupled Receptor 1 family, which could be rationalized from the underlying data distributions in each descriptor domain. In conclusion, both cell morphology changes and chemical structure information contain information about compound bioactivity, which is also partially complementary, and can hence contribute to in silico MoA analysis.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据