4.8 Article

A novel machine learning framework for automated biomedical relation extraction from large-scale literature repositories

期刊

NATURE MACHINE INTELLIGENCE
卷 2, 期 6, 页码 347-+

出版社

NATURE PORTFOLIO
DOI: 10.1038/s42256-020-0189-y

关键词

-

资金

  1. National Natural Science Foundation of China [61872216, 81630103, 31900862]
  2. Turing AI Institute of Nanjing
  3. Zhongguancun Haihua Institute for Frontier Information Technology

向作者/读者索取更多资源

A lot of scientific literature is unstructured, which makes extracting information for biomedical databases difficult. Hong and colleagues show that a distant supervision approach, using latent tree learning and recurrent units, can extract drug-target interactions from literature that were previously unknown. Knowledge about the relations between biomedical entities (such as drugs and targets) is widely distributed in more than 30 million research articles and consistently plays an important role in the development of biomedical science. In this work, we propose a novel machine learning framework, named BERE, for automatically extracting biomedical relations from large-scale literature repositories. BERE uses a hybrid encoding network to better represent each sentence from both semantic and syntactic aspects, and employs a feature aggregation network to make predictions after considering all relevant statements. More importantly, BERE can also be trained without any human annotation via a distant supervision technique. Through extensive tests, BERE has demonstrated promising performance in extracting biomedical relations, and can also find meaningful relations that were not reported in existing databases, thus providing useful hints to guide wet-lab experiments and advance the biological knowledge discovery process.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据