4.7 Article

Knowledge extraction from textual data and performance evaluation in an unsupervised context

期刊

INFORMATION SCIENCES
卷 629, 期 -, 页码 324-343

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2023.01.150

关键词

Natural Language Processing; Performance measure; Ontological relation extraction; Knowledge base; Automated validation

向作者/读者索取更多资源

This article proposes a method for measuring performance in unsupervised context of knowledge extraction. It also presents an unsupervised rule-based approach for domain-independent ontology population and knowledge extraction from textual data.
Among the incoming challenges in monitoring systems, the aggregation, synthesis and manage-ment of knowledge through ontological structures hold an essential place. Existing knowledge extraction systems often use a supervised approach that relies on annotated data, inducing implicitly a fastidious annotation process. Current research is towards the definition of unsupervised or semi-supervised systems, allowing a wider range of knowledge extraction. The evaluation of such systems, performing knowledge extraction using natural language processing methods requires performance indicators. The indicators usually used in such evaluations have limitations in the specific context of knowledge extraction for unsupervised ontology population. Thus, the definition of new evaluation methods becomes a need arising from the singularity of the harvested data, especially when these are not annotated. Hence, this article proposes a method for measuring performance in unsupervised context where reference data and extracted data do not overlap optimally. The proposed evaluation method is based on the exploitation of data that serve as a reference but are not specifically linked to the data used for extraction, which makes it an original evaluation method. To apply the performance measure on concrete cases, this paper also presents an unsupervised self-feeding rule-based approach for domain-independent ontology population from textual data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据