☆ 4.6 Article

Word sense induction using word embeddings and community detection in complex networks

PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS (2019)

期刊

PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS

卷 523, 期 -, 页码 180-190

出版社

ELSEVIER

DOI: 10.1016/j.physa.2019.02.032

关键词

Word sense induction; Language networks; Complex networks; Word embeddings; Community detection; Word sense disambiguation; Semantic networks

类别

Physics, Multidisciplinary

资金

Google USA (Research Awards in Latin America grant)
CAPES-Brazil
Sao Paulo Research Foundation (FAPESP) Brazil [2014/20830-0, 2016/19069-9, 2017/13464-6]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Word Sense Induction (WSI) is the ability to automatically induce word senses from corpora. The WSI task was first proposed to overcome the limitations of manually annotated corpus that are required in word sense disambiguation systems. Even though several works have been proposed to induce word senses, existing systems are still very limited in the sense that they make use of structured, domain-specific knowledge sources. In this paper, we devise a method that leverages recent findings in word embeddings research to generate context embeddings, which are embeddings containing information about the semantical context of a word. In order to induce senses, we modeled the set of ambiguous words as a complex network. In the generated network, two instances (nodes) are connected if the respective context embeddings are similar. Upon using well-established community detection methods to cluster the obtained context embeddings, we found that the proposed method yields excellent performance for the WSI task. Our method outperformed competing algorithms and baselines, in a completely unsupervised manner and without the need of any additional structured knowledge source. (C) 2019 Elsevier B.V. All rights reserved.

Word sense induction using word embeddings and community detection in complex networks

期刊

PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Word sense induction using word embeddings and community detection in complex networks

期刊

PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文