4.7 Article

Toward more realistic drug-target interaction predictions

期刊

BRIEFINGS IN BIOINFORMATICS
卷 16, 期 2, 页码 325-337

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbu010

关键词

drug-target interaction; kinase bioactivity assays; nested cross-validation; predictive modeling; supervised machine learning

资金

  1. Academy of Finland [120569, 133227, 140880, 269862, 272437, 134020]
  2. Centre for International Mobility (CIMO)
  3. Helsinki Biomedical Graduate Program (HBGP)
  4. Academy of Finland (AKA) [140880, 133227, 140880, 133227] Funding Source: Academy of Finland (AKA)

向作者/读者索取更多资源

A number of supervised machine learning models have recently been introduced for the prediction of drug-target interactions based on chemical structure and genomic sequence information. Although these models could offer improved means for many network pharmacology applications, such as repositioning of drugs for new therapeutic uses, the prediction models are often being constructed and evaluated under overly simplified settings that do not reflect the real-life problem in practical applications. Using quantitative drug-target bioactivity assays for kinase inhibitors, as well as a popular benchmarking data set of binary drug-target interactions for enzyme, ion channel, nuclear receptor and G protein-coupled receptor targets, we illustrate here the effects of four factors that may lead to dramatic differences in the prediction results: (i) problem formulation (standard binary classification or more realistic regression formulation), (ii) evaluation data set (drug and target families in the application use case), (iii) evaluation procedure (simple or nested cross-validation) and (iv) experimental setting (whether training and test sets share common drugs and targets, only drugs or targets or neither). Each of these factors should be taken into consideration to avoid reporting overoptimistic drug-target interaction prediction results. We also suggest guidelines on how to make the supervised drug-target interaction prediction studies more realistic in terms of such model formulations and evaluation setups that better address the inherent complexity of the prediction task in the practical applications, as well as novel benchmarking data sets that capture the continuous nature of the drug-target interactions for kinase inhibitors.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据