4.7 Article

Predicting emerging chemical content in consumer products using machine learning

期刊

SCIENCE OF THE TOTAL ENVIRONMENT
卷 834, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.scitotenv.2022.154849

关键词

Exposure modeling; Chemical function; Nanomaterials; Arti ficial intelligence; Cheminformatics; Environmental exposure; Consumer product safety; Nanotechnology

资金

  1. National Science Foundation (NSF) [DGE-2022040]
  2. EPA [EF-0830093, DBI-1266252]
  3. EPA under NSF

向作者/读者索取更多资源

This study evaluated the utility of machine learning strategies for predicting weight fractions in products with limited chemical constituent data. The findings suggest that functional use categories alongside chemical property data can improve predictive performance, leading to stratifying material-product observations into order of magnitude weight fractions with moderate success.
their consequent risk, we need to know their concentrations in products, or chemical weight fractions. Unfortunately, manufacturers rarely report comprehensive weight fraction data on product labels. The goal of this study was to evaluate the utility of machine learning strategies for predicting weight fractions when chemical constituent data are limited. A data-poor framework was developed and tested using a small dataset on consumer products containing engineered nanomaterials to represent emerging substances. A second, more traditional framework was applied to a data-rich product dataset comprised of bulk-scale organic chemicals for comparison purposes. Feature variables included chemical properties, functional use categories (e.g., antimicrobial), product categories (e.g., makeup), product matrix categories, and whether weight fractions were manufacturer-reported or experimentally obtained. Classification into three weight fraction bins was done using a random forest or nonlinear support vector classifier. An ablation study revealed that functional use data improved predictive performance when included alongside chemical property data, suggesting the utility of functional use categories in evaluating the safety and sustainability of emerging chemicals. Models could roughly stratify material-product observations into order of magnitude weight fractions with moderate success; the best of these achieved an average balanced accuracy of 73% on the nanomaterials product data. Framework comparisons also revealed a positive trend in sample size versus average balanced accuracy, suggesting great promise for machine learning approaches with continued investment in chemical data collection.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据