4.6 Article

Similarity Calculation via Passage-Level Event Connection Graph

期刊

APPLIED SCIENCES-BASEL
卷 12, 期 19, 页码 -

出版社

MDPI
DOI: 10.3390/app12199887

关键词

text similarity calculation; passage-level event connection graph; vector tuning; graph embedding

资金

  1. National Key Research and Development Project [2021YFF0901600]
  2. Project of State Key Laboratory of Communication Content Cognition [A02101]
  3. National Science Foundation of China [61976073, 62276083]
  4. Shenzhen Foundational Research Funding [JCYJ20200109113441941]

向作者/读者索取更多资源

This paper addresses the challenge of measuring text similarity in web applications and proposes a passage-level event connection graph to model the relationships between events mentioned in the text. The core event is revealed from the graph and used to measure text similarity. Two improvements are also provided to better model the relationships between events. Experimental results demonstrate that our calculation outperforms unsupervised methods and achieves comparable results to some supervised neuron-based methods in measuring text similarity.
Recently, many information processing applications appear on the web on the demand of user requirement. Since text is one of the most popular data formats across the web, how to measure text similarity becomes the key challenge to many web applications. Web text is often used to record events, especially for news. One text often mentions multiple events, while only the core event decides its main topic. This core event should take the important position when measuring text similarity. For this reason, this paper constructs a passage-level event connection graph to model the relations among events mentioned in one text. This graph is composed of many subgraphs formed by triggers and arguments extracted sentence by sentence. The subgraphs are connected via the overlapping arguments. In term of centrality measurement, the core event can be revealed from the graph and utilized to measure text similarity. Moreover, two improvements based on vector tunning are provided to better model the relations among events. One is to find the triggers which are semantically similar. By linking them in the event connection graph, the graph can cover the relations among events more comprehensively. The other is to apply graph embedding to integrate the global information carried by the entire event connection graph into the core event to let text similarity be partially guided by the full-text content. As shown by experimental results, after measuring text similarity from a passage-level event representation perspective, our calculation acquires superior results than unsupervised methods and even comparable results with some supervised neuron-based methods. In addition, our calculation is unsupervised and can be applied in many domains free from the preparation of training data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据