☆ 4.8 Article

Context-specific interaction networks from vector representation of words

NATURE MACHINE INTELLIGENCE (2019)

期刊

NATURE MACHINE INTELLIGENCE

卷 1, 期 4, 页码 181-190

出版社

NATURE PORTFOLIO

DOI: 10.1038/s42256-019-0036-1

关键词

类别

Computer Science, Artificial Intelligence Computer Science, Interdisciplinary Applications

资金

European Union [668858]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Biomedical publications provide a rich and largely untapped source of knowledge. INtERAcT exploits word embeddings trained on a corpus of cancer-specific articles to estimate molecular interactions. The algorithm is able to reconstruct molecular pathways associated with ten cancer types, even in corpora of limited size. The number of biomedical publications has grown steadily in recent years. However, most biomedical facts are not readily available, but buried in the form of unstructured text. Here we present INtERAcT, an unsupervised method to extract interactions from a corpus of biomedical articles. INtERAcT exploits a vector representation of words, computed on a corpus of domain-specific knowledge, and implements a new metric that estimates an interaction score between two molecules in the space where the corresponding words are embedded. We use INtERAcT to reconstruct the molecular pathways of 10 different cancer types using corpora of disease-specific articles, considering the STRING database as a benchmark. Our metric outperforms currently adopted approaches and it is highly robust to parameter choices, leading to the identification of known molecular interactions in all studied cancer types. Furthermore, our approach does not require text annotation, manual curation or the definition of semantic rules based on expert knowledge, and can therefore be efficiently applied to different scientific domains.

Context-specific interaction networks from vector representation of words

期刊

NATURE MACHINE INTELLIGENCE

出版社

NATURE PORTFOLIO

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Context-specific interaction networks from vector representation of words

期刊

NATURE MACHINE INTELLIGENCE

出版社

NATURE PORTFOLIO

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文