4.5 Article

Evaluating explorative prediction power of machine learning algorithms for materials discovery using k-fold forward cross-validation

期刊

COMPUTATIONAL MATERIALS SCIENCE
卷 171, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.commatsci.2019.109203

关键词

Materials discovery; Machine learning; Cross-validation; Extrapolation; Exploration; Interpolation; Performance evaluation

资金

  1. NSF
  2. SC EPSCoR/IDeA Program [OIA-1655740, 19-GC02]
  3. [1905775]

向作者/读者索取更多资源

The materials discovery problem usually aims to identify novel outlier materials with extremely low or high property values outside of the scope of all known materials. It can be mapped as an explorative prediction problem. However, currently the performance of machine learning algorithms for materials property prediction is usually evaluated via k-fold cross-validation (CV) or holdout-test, which tend to over-estimate their explorative prediction performance in discovering novel materials. We propose k-fold-m-step forward cross-validation (kmFCV) as a new way for evaluating exploration performance in materials property prediction and conducted a comprehensive benchmark evaluation on the exploration performance of a variety of prediction models on materials property (including formation energy, band gap, and superconducting critical temperature) prediction with different materials representation and machine learning algorithms. Our results show that even though current machine learning models can achieve good results when evaluated with traditional CV, their explorative power is actually very low as shown by our proposed kmFCV evaluation method and the proposed exploration accuracy. More advanced explorative machine learning algorithms are strongly needed for new materials discovery.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据