4.7 Article Data Paper

Data Descriptor: Auto-generated materials database of Curie and Neel temperatures via semi-supervised relationship extraction

期刊

SCIENTIFIC DATA
卷 5, 期 -, 页码 -

出版社

NATURE PUBLISHING GROUP
DOI: 10.1038/sdata.2018.111

关键词

-

资金

  1. EPSRC Computational Methods in Materials Science Centre [EP/L015552/1]
  2. Royal Commission of the 1851 Great Exhibition
  3. DOE Office of Science, Office of Basic Energy Sciences [DE-AC02-06CH11357]

向作者/读者索取更多资源

Large auto-generated databases of magnetic materials properties have the potential for great utility in materials science research. This article presents an auto-generated database of 39,822 records containing chemical compounds and their associated Curie and Neel magnetic phase transition temperatures. The database was produced using natural language processing and semi-supervised quaternary relationship extraction, applied to a corpus of 68,078 chemistry and physics articles. Evaluation of the database shows an estimated overall precision of 73%. Therein, records processed with the text-mining toolkit, ChemDataExtractor, were assisted by a modified Snowball algorithm, whose original binary relationship extraction capabilities were extended to quaternary relationship extraction. Consequently, its machine learning component can now train with <= 500 seeds, rather than the 4,000 originally used. Data processed with the modified Snowball algorithm affords 82% precision. Database records are available in MongoDB, CSV and JSON formats which can easily be read using Python, R, Java and MatLab. This makes the database easy to query for tackling big-data materials science initiatives and provides a basis for magnetic materials discovery. [GRAPHICS] .

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据