4.8 Article

Impact of atomistic or crystallographic descriptors for classification of gold nanoparticles

期刊

NANOSCALE
卷 13, 期 27, 页码 11887-11898

出版社

ROYAL SOC CHEMISTRY
DOI: 10.1039/d1nr02258j

关键词

-

资金

  1. National Computing Infrastructure (NCI) national facility [p00]

向作者/读者索取更多资源

This study compared supervised and unsupervised learning results on gold nanoparticles, finding that model selection is unaffected by descriptors but sensitive to particle description. Atomistic descriptors offered a more detailed interpretation and outperformed crystallographic descriptors, even in challenging multi-classification scenarios.
Machine learning models are known to be sensitive to the features used to train them, but there is currently no way to predict the impact of using different features prior to feature extraction. This is particularly important to fields such as nanotechnology that are highly multi-disciplinary, and samples can be characterised many different ways depending on the preferences of individual researchers. Does it matter if nanomaterials are described using the interatomic coordinations or more complex order parameters? In this study we compare results of supervised and unsupervised learning on a single set of gold nanoparticles that has been characterised by two different descriptors, each with a unique feature space. We find that there are some consistencies, and model selection is descriptor-agnostic, but the level of detail and the type of information that can be extracted from the results is sensitive to the way the particles are described. Unsupervised clustering revealed that an atomistic descriptor provides a finer-grained interpretation and clusters that are sub-clusters of a more sophisticated crystallographic descriptor, which is consistent with both how the features were calculated, and how they are interpreted in the domain. A supervised classifier revealed that the types of features responsible for the separation are related to the bulk structure, regardless of the descriptor, but capture different types of information. For both the atomistic and crystallographic descriptor the gradient boosting decision tree classifier gave superior results of F1-scores of 0.96 and 0.98, respectively, with excellent precision and recall, even though the clustering presented a challenging multi-classification problem.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据