4.5 Article

Enhancing semantic text similarity with functional semantic knowledge (FOP) in patents

期刊

JOURNAL OF INFORMETRICS
卷 18, 期 1, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.joi.2023.101467

关键词

Semantic text similarity (STS); Subject -action -object (SAO); Functional semantic knowledge (FOP); Bi-LSTM; Patent similarity; Pre -trained embedding

向作者/读者索取更多资源

In this paper, a new method based on functional semantic knowledge (FOP) is proposed for patent similarity calculation. Furthermore, patent STS datasets are processed and released as benchmarks. Preliminary results show that FOP-based methods are more suitable for STS tasks when combined with IPC codes, weights' assignments, and patent pre-trained vectors.
The semantic text similarity (STS) estimation between patents is a critical issue for the patent portfolio analysis. Current methods such as keywords, co-word analysis and even the SubjectAction-Object (SAO) algorithms, are not quite reasonable for the patent similarity calculation due to the lack of fine-grained semantic knowledge, property-parameter features and flexible functional or non-functional combinations. In the meanwhile, standardized similarity datasets are also unavailable. In this paper, we have proposed a new kind of functional semantic knowledge (Function-Object-Property, i.e., FOP) instead of SAO triples, which can contribute directly to enhance the patent similarity. Moreover, patent STS datasets, including the matching dataset and the ranking dataset, have firstly been processed and released as benchmarks for the comparative evaluation. Preliminary results have demonstrated that FOP-based methods are more appropriate in the STS tasks incorporated with IPC codes, weights' assignments and patent pre-trained vectors. To be further, the deep interaction-based models with the averaged FOP embeddings are recommended to be one of the most optimal choices of effectively improving the semantic learning capability. Finally, a new patent similarity calculation framework is summarized and successfully applied in the patent retrieval, which highlight that the proposed methodology serves as a dominant power in diverse patented STS tasks.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据