期刊
COMPUTATIONAL MATERIALS SCIENCE
卷 171, 期 -, 页码 -出版社
ELSEVIER
DOI: 10.1016/j.commatsci.2019.109203
关键词
Materials discovery; Machine learning; Cross-validation; Extrapolation; Exploration; Interpolation; Performance evaluation
资金
- NSF
- SC EPSCoR/IDeA Program [OIA-1655740, 19-GC02]
- [1905775]
The materials discovery problem usually aims to identify novel outlier materials with extremely low or high property values outside of the scope of all known materials. It can be mapped as an explorative prediction problem. However, currently the performance of machine learning algorithms for materials property prediction is usually evaluated via k-fold cross-validation (CV) or holdout-test, which tend to over-estimate their explorative prediction performance in discovering novel materials. We propose k-fold-m-step forward cross-validation (kmFCV) as a new way for evaluating exploration performance in materials property prediction and conducted a comprehensive benchmark evaluation on the exploration performance of a variety of prediction models on materials property (including formation energy, band gap, and superconducting critical temperature) prediction with different materials representation and machine learning algorithms. Our results show that even though current machine learning models can achieve good results when evaluated with traditional CV, their explorative power is actually very low as shown by our proposed kmFCV evaluation method and the proposed exploration accuracy. More advanced explorative machine learning algorithms are strongly needed for new materials discovery.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据