4.6 Article

SimBoost: a read-across approach for predicting drug-target binding affinities using gradient boosting machines

Journal

JOURNAL OF CHEMINFORMATICS
Volume 9, Issue -, Pages -

Publisher

BMC
DOI: 10.1186/s13321-017-0209-z

Keywords

Read-across; Gradient boosting; Drug-target interaction; Prediction interval; Applicability Domain; QSAR

Funding

  1. NSERC Create program Computational Methods for the Analysis of the Diversity and Dynamics of Genomes [433905-2013]
  2. NSERC Discovery

Ask authors/readers for more resources

Computational prediction of the interaction between drugs and targets is a standing challenge in the field of drug discovery. A number of rather accurate predictions were reported for various binary drug-target benchmark data-sets. However, a notable drawback of a binary representation of interaction data is that missing endpoints for non-interacting drug-target pairs are not differentiated from inactive cases, and that predicted levels of activity depend on pre-defined binarization thresholds. In this paper, we present a method called SimBoost that predicts continuous (non-binary) values of binding affinities of compounds and proteins and thus incorporates the whole interaction spectrum from true negative to true positive interactions. Additionally, we propose a version of the method called SimBoostQuant which computes a prediction interval in order to assess the confidence of the predicted affinity, thus defining the Applicability Domain metrics explicitly. We evaluate SimBoost and SimBoostQuant on two established drug-target interaction benchmark datasets and one new dataset that we propose to use as a benchmark for read-across cheminformatics applications. We demonstrate that our methods outperform the previously reported models across the studied datasets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available