4.4 Article

Augmenting bioactivity by docking-generated multiple ligand poses to enhance machine learning and pharmacophore modelling: discovery of new TTK inhibitors as case study

Journal

MOLECULAR INFORMATICS
Volume 42, Issue 6, Pages -

Publisher

WILEY-V C H VERLAG GMBH
DOI: 10.1002/minf.202300022

Keywords

docking; machine learning; QSAR; scoring; Shapley values; TTK

Ask authors/readers for more resources

This study used multiple docked poses of TTK inhibitors to augment training data for machine learning QSAR modeling. Critical descriptors for predicting anti-TTK bioactivity and for pharmacophore generation were determined, and three successful pharmacophores were deduced. These pharmacophores were then used for in silico screening against the NCI database, resulting in the evaluation of 14 hits with one compound showing reasonable dose-response curve and experimental IC50 of 1.0μM.
Dual specificity protein kinase threonine/Tyrosine kinase (TTK) is one of the mitotic kinases. High levels of TTK are detected in several types of cancer. Hence, TTK inhibition is considered a promising therapeutic anti-cancer strategy. In this work, we used multiple docked poses of TTK inhibitors to augment training data for machine learning QSAR modeling. Ligand-Receptor Contacts Fingerprints and docking scoring values were used as descriptor variables. Escalating docking-scoring consensus levels were scanned against orthogonal machine learners, and the best learners (Random Forests and XGBoost) were coupled with genetic algorithm and Shapley additive explanations (SHAP) to determine critical descriptors for predicting anti-TTK bioactivity and for pharmacophore generation. Three successful pharmacophores were deduced and subsequently used for in silico screening against the NCI database. A total of 14 hits were evaluated in vitro for their anti-TTK bioactivities. One hit of novel chemotype showed reasonable dose-response curve with experimental IC50 of 1.0 mu M. The presented work indicates the validity of data augmentation using multiple docked poses for building successful machine learning models and pharmacophore hypotheses.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available