4.7 Article

pdCSM-cancer: Using Graph-Based Signatures to Identify Small Molecules with Anticancer Properties

期刊

JOURNAL OF CHEMICAL INFORMATION AND MODELING
卷 61, 期 7, 页码 3314-3322

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jcim.1c00168

关键词

-

资金

  1. Kingdom of Saudi Arabia
  2. Joe White Bequest Fellowship
  3. Medical Research Council [MR/M026302/1]
  4. National Health and Medical Research Council of Australia [GNT1174405]
  5. Wellcome Trust [093167/Z/10/Z]
  6. Jack Brockhoff Foundation [JBF 4186]
  7. Victorian Government's Operational Infrastructure Support Program
  8. MRC [MR/M026302/1] Funding Source: UKRI
  9. Wellcome Trust [093167/Z/10/Z] Funding Source: Wellcome Trust

向作者/读者索取更多资源

The development of a novel predictive tool, pdCSM-cancer, provides an accurate way to predict molecules likely to be active against one or multiple cancer cell lines by using graph-based chemical structure signatures. This tool includes trained and validated models on data from over 18,000 compounds on 9 tumor types and 74 distinct cancer cell lines, achieving successful predictive performance.
The development of new, effective, and safe drugs to treat cancer remains a challenging and time-consuming task due to limited hit rates, restraining subsequent development efforts. Despite the impressive progress of quantitative structure-activity relationship and machine learning-based models that have been developed to predict molecule pharmacodynamics and bioactivity, they have had mixed success at identifying compounds with anticancer properties against multiple cell lines. Here, we have developed a novel predictive tool, pdCSM-cancer, which uses a graph-based signature representation of the chemical structure of a small molecule in order to accurately predict molecules likely to be active against one or multiple cancer cell lines. pdCSM-cancer represents the most comprehensive anticancer bioactivity prediction platform developed till date, comprising trained and validated models on experimental data of the growth inhibition concentration (GI50%) effects, including over 18,000 compounds, on 9 tumor types and 74 distinct cancer cell lines. Across 10-fold cross-validation, it achieved Pearson's correlation coefficients of up to 0.74 and comparable performance of up to 0.67 across independent, non-redundant blind tests. Leveraging the insights from these cell line-specific models, we developed a generic predictive model to identify molecules active in at least 60 cell lines. Our final model achieved an area under the receiver operating characteristic curve (AUC) of up to 0.94 on 10-fold cross-validation and up to 0.94 on independent non-redundant blind tests, outperforming alternative approaches. We believe that our predictive tool will provide a valuable resource to optimizing and enriching screening libraries for the identification of effective and safe anticancer molecules. To provide a simple and integrated platform to rapidly screen for potential biologically active molecules with favorable anticancer properties, we made pdCSM-cancer freely available online at http://blosig.unimetb.edu.au/pdcsm_cancer.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据