4.7 Article

An approach for measuring semantic similarity between Wikipedia concepts using multiple inheritances

Journal

INFORMATION PROCESSING & MANAGEMENT
Volume 57, Issue 3, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.ipm.2019.102188

Keywords

Semantic similarity; Multiple inheritance; Information content; Wikipedia category graph; Knowledge graph

Funding

  1. National Natural Science Foundation of China [61772210, U1911201]
  2. Guangdong Province Universities Pearl River Scholar Funded Scheme (2018)
  3. Project of Science and Technology in Guangzhou in China [201807010043]

Ask authors/readers for more resources

Wikipedia provides a huge collaboratively made semi-structured taxonomy called Wikipedia category graph (WCG), which can be utilized as a Knowledge Graph (KG) to measure the semantic similarity (SS) between Wikipedia concepts. Previously, several Most Informative Common Ancestor-based (MICA-based) SS methods have been proposed by intrinsically manipulating the taxonomic structure of WCG. However, some basic structural issues in WCG such as huge size, branching factor and multiple inheritance relations hamper the applicability of traditional MICA-based and multiple inheritance-based approaches in it. Therefore, in this paper, we propose a solution to handle these structural issues and present a new multiple inheritance-based SS approach, called Neighborhood Ancestor Semantic Contribution (NASC). In this approach, firstly, we define the neighborhood of a category (a taxonomic concept in WCG) to define its semantic space. Secondly, we describe the semantic value of a category by aggregating the intrinsic IC-based semantic contribution weights of its semantically relevant multiple ancestors. Thirdly, based on our approach, we propose six different methods to compute the SS between Wikipedia concepts. Finally, we evaluate our methods on gold standard word similarity benchmarks for English, German, Spanish and French languages. The experimental evaluation demonstrates that the proposed NASC-based methods remarkably outperform traditional MICA-based and multiple inheritance-based approaches.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available