3.9 Article

Graph-based exploration and clustering analysis of semantic spaces

期刊

APPLIED NETWORK SCIENCE
卷 4, 期 1, 页码 -

出版社

SPRINGERNATURE
DOI: 10.1007/s41109-019-0228-y

关键词

Semantic spaces; Graph theory; Word2vec similarity networks; Cohesive clusters; Cliques; Clique relaxations

资金

  1. U.S. Air Force Research Laboratory (AFRL) [FA8651-16-2-0009]
  2. U.S. Air Force Research Laboratory (AFRL) European Office of Aerospace Research and Development [FA9550-17-1-0030]

向作者/读者索取更多资源

The goal of this study is to demonstrate how network science and graph theory tools and concepts can be effectively used for exploring and comparing semantic spaces of word embeddings and lexical databases. Specifically, we construct semantic networks based on word2vec representation of words, which is learnt from large text corpora (Google news, Amazon reviews), and human built word networks derived from the well-known lexical databases: WordNet and Moby Thesaurus. We compare global (e.g., degrees, distances, clustering coefficients) and local (e.g., most central nodes and community-type dense clusters) characteristics of considered networks. Our observations suggest that human built networks possess more intuitive global connectivity patterns, whereas local characteristics (in particular, dense clusters) of the machine built networks provide much richer information on the contextual usage and perceived meanings of words, which reveals interesting structural differences between human built and machine built semantic networks. To our knowledge, this is the first study that uses graph theory and network science in the considered context; therefore, we also provide interesting examples and discuss potential research directions that may motivate further research on the synthesis of lexicographic and machine learning based tools and lead to new insights in this area.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.9
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据