4.3 Article

Consensus models to predict oral rat acute toxicity and validation on a dataset coming from the industrial context

期刊

SAR AND QSAR IN ENVIRONMENTAL RESEARCH
卷 30, 期 12, 页码 879-897

出版社

TAYLOR & FRANCIS LTD
DOI: 10.1080/1062936X.2019.1672089

关键词

QSAR; QSPR; generative topographic mapping (GTM); oral rat acute toxicity; OECD principles; REACH

向作者/读者索取更多资源

We report predictive models of acute oral systemic toxicity representing a follow-up of our previous work in the framework of the NICEATM project. It includes the update of original models through the addition of new data and an external validation of the models using a dataset relevant for the chemical industry context. A regression model for LD50 and multi-class classification model for toxicity classes according to the Global Harmonized System categories were prepared. ISIDA descriptors were used to encode molecular structures. Machine learning algorithms included support vector machine (SVM), random forest (RF) and na?ve Bayesian. Selected individual models were combined in consensus. The different datasets were compared using the generative topographic mapping approach. It appeared that the NICEATM datasets were lacking some relevant chemotypes for chemical industry. The new models trained on enlarged data sets have applicability domains (AD) sufficiently large to accommodate industrial compounds. The fraction of compounds inside the models? AD increased from 58% (NICEATM model) to 94% (new model). The increase of training sets improved models? prediction performance: RMSE values decreased from 0.56 to 0.47 and balanced accuracies increased from 0.69 to 0.71 for NICEATM and new models, respectively.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据