4.6 Article

Discovery of new STAT3 inhibitors as anticancer agents using ligand-receptor contact fingerprints and docking-augmented machine learning

Journal

RSC ADVANCES
Volume 13, Issue 7, Pages 4623-4640

Publisher

ROYAL SOC CHEMISTRY
DOI: 10.1039/d2ra07007c

Keywords

-

Ask authors/readers for more resources

STAT3 is a vital transcription factor that is present in high levels in various types of cancer. Inhibition of STAT3 is considered a promising anti-cancer strategy. This study used multiple docked poses of STAT3 inhibitors to enhance machine learning QSAR modeling and identified critical descriptors for anti-STAT3 bioactivity. Two successful pharmacophores were generated and tested against a cancer database, resulting in the discovery of three novel compounds with potent anti-STAT3 effects. The findings highlight the potential of data augmentation in machine learning models for compound discrimination.
STAT3 belongs to a family of seven vital transcription factors. High levels of STAT3 are detected in several types of cancer. Hence, STAT3 inhibition is considered a promising therapeutic anti-cancer strategy. In this work, we used multiple docked poses of STAT3 inhibitors to augment training data for machine learning QSAR modeling. Ligand-Receptor Contact Fingerprints and scoring values were implemented as descriptor variables. Escalating docking-scoring consensus levels were scanned against orthogonal machine learners, and the best learners (Random Forests and XGBoost) were coupled with genetic algorithm and Shapley additive explanations (SHAP) to identify critical descriptors that determine anti-STAT3 bioactivity to be translated into pharmacophore model(s). Two successful pharmacophores were deduced and subsequently used for in silico screening against the National Cancer Institute (NCI) database. A total of 26 hits were evaluated in vitro for their anti-STAT3 bioactivities. Out of which, three hits of novel chemotypes, showed cytotoxic IC50 values in the nanomolar range (35 nM to 6.7 mu M). However, two are potent dihydrofolate reductase (DHFR) inhibitors and therefore should have significant indirect STAT3 inhibitory effects. The third hit (cytotoxic IC50 = 0.44 mu M) is purely direct STAT3 inhibitor (devoid of DHFR activity) and caused, at its cytotoxic IC50, more than two-fold reduction in the expression of STAT3 downstream genes (c-Myc and Bcl-xL). The presented work indicates that the concept of data augmentation using multiple docked poses is a promising strategy for generating valid machine learning models capable of discriminating active from inactive compounds.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available