4.7 Review

Opportunities and challenges of text mining in materials research

期刊

ISCIENCE
卷 24, 期 3, 页码 -

出版社

CELL PRESS
DOI: 10.1016/j.isci.2021.102155

关键词

-

资金

  1. National Science Foundation [1922311, 1922372, 1922090]
  2. Office of Naval Research (ONR) [N00014-16-1-2432]
  3. U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, Materials Sciences and Engineering Division [DE-AC02-05-CH11231]
  4. Assistant Secretary of Energy Efficiency and Renewable Energy, Vehicle Technologies Office, U.S. Department of Energy [DE-AC02-05CH11231]
  5. Energy Biosciences Institute through the EBI-Shell program [PT74140, PT78473]
  6. Direct For Mathematical & Physical Scien [1922372] Funding Source: National Science Foundation
  7. Direct For Mathematical & Physical Scien
  8. Division Of Materials Research [1922311, 1922090] Funding Source: National Science Foundation
  9. Division Of Materials Research [1922372] Funding Source: National Science Foundation

向作者/读者索取更多资源

Research publications serve as the primary repository of scientific knowledge, but their unstructured format poses obstacles to large-scale analysis. Recent advances in natural language processing have provided tools for information extraction, but challenges arise when applied to scientific text with technical terminology. Text mining methodology in materials science is still in its early stages, with efforts focused on understanding the application of TM and NLP in this field.
Research publications are the major repository of scientific knowledge. However, their unstructured and highly heterogenous format creates a significant obstacle to large-scale analysis of the information contained within. Recent progress in natural language processing (NLP) has provided a variety of tools for high-quality information extraction from unstructured text. These tools are primarily trained on non-technical text and struggle to produce accurate results when applied to scientific text, involving specific technical terminology. During the last years, significant efforts in information retrieval have been made for biomedical and biochemical publications. For materials science, text mining (TM) methodology is still at the dawn of its development. In this review, we survey the recent progress in creating and applying TM and NLP approaches to materials science field. This review is directed at the broad class of researchers aiming to learn the fundamentals of TM as applied to the materials science publications.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据