4.7 Article

Combining multi-dimensional molecular fingerprints to predict the hERG cardiotoxicity of compounds

Journal

COMPUTERS IN BIOLOGY AND MEDICINE
Volume 144, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.compbiomed.2022.105390

Keywords

Molecular fingerprint; Machine learning; hERG; Ensemble; Prediction models

Funding

  1. National Natural Science Founda-tion of China [82003655]
  2. Liaoning Province Reju-venating Talents Plan [XLYC2002045]
  3. Key R&D Program of Liaoning Province [2019JH2/10300041]
  4. Scientific Research Project from Department of Education of Liaoning Province [LJKZ0088, LQN201906]
  5. Shenyang Science and Technology Talent Project

Ask authors/readers for more resources

Recently, drug toxicity has become a critical problem with heavy medical and economic burdens. In this study, multi-dimensional molecular fingerprints and consensus models were used to predict hERG cardiotoxicity. The results showed that the model outperformed recently published methods, indicating high predictive accuracy.
Recently, drug toxicity has become a critical problem with heavy medical and economic burdens. Acquired long QT syndrome (acLQTS) is an acquired cardiac ion channel disease caused by drugs blocking the hERG channel. Therefore, it is necessary to avoid cardiotoxicity in drug design, and computer models have been widely used to fix this predicament. In this study, we collected a hERG inhibitor dataset containing 8671 compounds, and then, these compounds were featurized by traditional molecular fingerprints (including Baseline2D, ECFP4, PropertyFP, and 3DFP) and the newly proposed molecular dynamics fingerprint (MDFP). Subsequently, regression prediction models were established by using four machine learning algorithms based on these fingerprints and the combined multi-dimensional molecular fingerprints (MultiFP). After cross-validation and independent test dataset validation, the results show that the best model was built by the consensus of four algorithms with MultiFP, and this model bests recently published methods in terms of hERG cardiotoxicity prediction with a RMSE of 0.531 and a R-2 of 0.653 on the test dataset. Feature importance analysis and correlation analysis identified some novel structural features and molecular dynamics features that are highly associated with the hERG inhibition of compounds. Our findings provide new insight into multi-dimensional molecular fingerprints and consensus models for hERG cardiotoxicity prediction.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available