☆ 4.6 Article

Learning domain ontologies from document warehouses and dedicated web sites

COMPUTATIONAL LINGUISTICS (2004)

期刊

COMPUTATIONAL LINGUISTICS

卷 30, 期 2, 页码 151-179

出版社

M I T PRESS

DOI: 10.1162/089120104323093276

关键词

类别

Computer Science, Artificial Intelligence Computer Science, Interdisciplinary Applications Linguistics Language & Linguistics

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

We present a method and a tool, OntoLearn, aimed at the extraction of domain ontologies from Web sites, and more generally from documents shared among the members of virtual organizations. OntoLearn first extracts a domain terminology from available documents. Then, complex domain terms are semantically interpreted and arranged in a hierarchical fashion. Finally, a general-purpose ontology, WordNet, is trimmed and enriched with the detected domain concepts. The major novel aspect of this approach is semantic interpretation, that is, the association of a complex concept with a complex term. This involves finding the appropriate WordNet concept for each word of a terminological string and the appropriate conceptual relations that hold among the concept components. Semantic interpretation is based on a new word sense disambiguation algorithm, called structural semantic interconnections.

Learning domain ontologies from document warehouses and dedicated web sites

期刊

COMPUTATIONAL LINGUISTICS

出版社

M I T PRESS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Learning domain ontologies from document warehouses and dedicated web sites

期刊

COMPUTATIONAL LINGUISTICS

出版社

M I T PRESS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文