4.7 Article

Discriminant models on mitochondrial toxicity improved by consensus modeling and resolving imbalance in training

期刊

CHEMOSPHERE
卷 253, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.chemosphere.2020.126768

关键词

Mitochondrial toxicity; QSAR; Machine learning; Consensus model; Imbalanced data

资金

  1. National Key R&D Program of China [2018YFC1801604, 2018YFE0110700]
  2. National Natural Science Foundation of China [21661142001]

向作者/读者索取更多资源

Humans and animals may be exposed to tens of thousands of natural and synthetic chemicals during their lifespan. It is difficult to assess risk for all the chemicals with experimental toxicity tests. An alternative approach is to use computational toxicology methods such as quantitative structure-activity relationship (QSAR) modeling. Mitochondrial toxicity is involved in many diseases such as cancer, neurodegeneration, type 2 diabetes, cardiovascular diseases and autoimmune diseases. Thus, it is important to rapidly and efficiently identify chemicals with mitochondrial toxicity. In this study, five machine learning algorithms and twelve types of molecular fingerprints were employed to generate QSAR discriminant models for mitochondrial toxicity. A threshold moving method was adopted to resolve the imbalance issue in the training data. Consensus of the models by an averaging probability strategy improved prediction performance. The best model has correct classification rates of 81.8% and 88.3% in ten-fold cross validation and external validation, respectively. Substructures such as phenol, carboxylic acid, nitro and arylchloride were found informative through analysis of information gain and frequency of substructures. The results demonstrate that resolving imbalance in training and building consensus models can improve classification rates for mitochondrial toxicity prediction. (C) 2020 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据