4.2 Article

In silico prediction of the full United Nations Globally Harmonized System eye irritation categories of liquid chemicals by IATA-like bottom-up approach of random forest method

出版社

TAYLOR & FRANCIS INC
DOI: 10.1080/15287394.2021.1956661

关键词

Eye irritation potential; machine-learning; physicochemical descriptor; random forest; in silico

资金

  1. Ministry of Science and ICT, South Korea [2018R1A5A2025286]
  2. Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI) - Ministry of Health and Welfare, Republic of Korea [HP20C0061]

向作者/读者索取更多资源

This study aimed to construct an in silico model to predict the eye irritation category of liquid chemicals, and achieved high accuracy in ternary categorization of eye irritation potential with a two-stage random forest approach. The prediction model showed excellent performance in distinguishing Category 1 and Category 2 chemicals.
As an alternative to in vivo Draize rabbit eye irritation test, this study aimed to construct an in silico model to predict the complete United Nations (UN) Globally Harmonized System (GHS) for classification and labeling of chemicals for eye irritation category [eye damage (Category 1), irritating to eye (Category 2) and nonirritating (No category)] of liquid chemicals with Integrated approaches to testing and assessment (IATA)-like two-stage random forest approach. Liquid chemicals (n = 219) with 34 physicochemical descriptors and quality in vivo data were collected with no missing values. Seven machine learning algorithms (Naive Bayes, Logistic Regression, First Large Margin, Neural Net, Random Forest (RF), Gradient Boosted Tree, and Support Vector Machine) were examined for the ternary categorization of eye irritation potential at a single run through 10-fold cross-validation. RF, which performed best, was further improved by applying the 'Bottom-up approach' concept of IATA, namely, separating No category first, and discriminating Category 1 from 2, thereafter. The best performing training dataset achieved an overall accuracy of 73% and the correct prediction for Category 1, 2, and No category was 80%, 50%, and 77%, respectively for the test dataset. This prediction model was further validated with an external dataset of 28 chemicals, for which an overall accuracy of 71% was achieved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据