4.7 Article

CYPlebrity: Machine learning models for the prediction of inhibitors of cytochrome P450 enzymes

期刊

BIOORGANIC & MEDICINAL CHEMISTRY
卷 46, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.bmc.2021.116388

关键词

Drug metabolism; Cytochrome P450 inhibition; Metabolism prediction; Machine learning

资金

  1. Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) [KI 2085/1-1]
  2. Ministry of Education, Youth and Sports of the Czech Republic [LM2018130]

向作者/读者索取更多资源

The majority of approved drugs are metabolized by five major CYP isozymes, inhibition of these isozymes can lead to drug-drug interactions. There are numerous machine learning models available for predicting the inhibition of CYP isozymes. This study combines data from various sources to train classification models, achieving competitive performance and providing freely accessible models for academic research.
The vast majority of approved drugs are metabolized by the five major cytochrome P450 (CYP) isozymes, 1A2, 2C9, 2C19, 2D6 and 3A4. Inhibition of CYP isozymes can cause drug-drug interactions with severe pharmacological and toxicological consequences. Computational methods for the fast and reliable prediction of the inhibition of CYP isozymes by small molecules are therefore of high interest and relevance to pharmaceutical companies and a host of other industries, including the cosmetics and agrochemical industries. Today, a large number of machine learning models for predicting the inhibition of the major CYP isozymes by small molecules are available. With this work we aim to go beyond the coverage of existing models, by combining data from several major public and proprietary sources. More specifically, we used up to 18815 compounds with measured bioactivities to train random forest classification models for the individual CYP isozymes. A major advantage of the new data collection over existing ones is the better representation of the minority class, the CYP inhibitors. With the new data collection we achieved inhibitor-to-non-inhibitor ratios in the order of 1:1 (CYP1A2) to 1:3 (CYP2D6). We show that our models reach competitive performance on external data, with Matthews correlation coefficients (MCCs) ranging from 0.62 (CYP2C19) to 0.70 (CYP2D6), and areas under the receiver operating characteristic curve (AUCs) between 0.89 (CYP2C19) and 0.92 (CYPs 2D6 and 3A4). Importantly, the models show a high level of robustness, reflected in a good predictivity also for compounds that are structurally dissimilar to the compounds represented in the training data. The best models presented in this work are freely accessible for academic research via a web service.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据