4.7 Article

Application of machine learning to predict the inhibitory activity of organic chemicals on thyroid stimulating hormone receptor

Journal

ENVIRONMENTAL RESEARCH
Volume 212, Issue -, Pages -

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.envres.2022.113175

Keywords

Bayesian optimization; Imbalanced data; Molecular docking; QSAR

Funding

  1. National Natural Science Foundation of China [21777022]
  2. Fundamental Research Funds for the Central Universities [2412018ZD014]

Ask authors/readers for more resources

With the promotion of carbon neutrality, it is important to simultaneously focus on the assessment and sustainable management of chemicals to protect public health. This study successfully established a predictive model based on compounds derived from cyclic adenosine monophosphate (cAMP) analysis, effectively identifying and predicting chemicals and active compounds that inhibit the thyroid stimulating hormone receptor (TSHR).
With the promotion of carbon neutrality, it is also important to synchronously promote the assessment and sustainable management of chemicals so as to protect public health. Humans and animals are possibly exposed to endocrine disruptors that have inhibitory effects on thyroid stimulating hormone receptor (TSHR). As such, it is important to identify chemicals that inhibit TSHR and to develop models to predict their inhibitory activity. In this study, 5952 compounds derived from a cyclic adenosine monophosphate (cAMP) analysis, a key signaling pathway in thyrocytes, were used to establish a binary classification model comparing methods that included random forest (RF), extreme gradient boosting (XGB), and logistic regression (LR). The prediction model based on RF showed the highest identification accuracy for revealing chemicals that may inhibit TSHR. For the RF model, recall was calculated at 0.89, balance accuracy was 0.85, and its receiver operating characteristic (ROC) curve-area under (AUC) was 0.92, indicating that the model had very high predictive capacity. The lowest CDocker energy (CE) and CDocker interaction energy (CIE) for chemicals and TSHR were determined and were subsequently introduced into the predictive model as descriptors. A regression model, extreme gradient boostingRegression (XGBR), was successfully established yielding an R2 = 0.65 to predict inhibitory activity for active compounds. Parameters that included dissociation characteristics, molecular structure, and binding energy were all key factors in the predictive model. We demonstrate that QSAR models are useful approaches, not only for identifying chemicals that inhibit TSHR, but for predicting inhibitory activity of active compounds.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available