☆ 4.6 Article

Learning domain ontologies from document warehouses and dedicated web sites

COMPUTATIONAL LINGUISTICS (2004)

Journal

COMPUTATIONAL LINGUISTICS

Volume 30, Issue 2, Pages 151-179

Publisher

M I T PRESS

DOI: 10.1162/089120104323093276

Keywords

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

We present a method and a tool, OntoLearn, aimed at the extraction of domain ontologies from Web sites, and more generally from documents shared among the members of virtual organizations. OntoLearn first extracts a domain terminology from available documents. Then, complex domain terms are semantically interpreted and arranged in a hierarchical fashion. Finally, a general-purpose ontology, WordNet, is trimmed and enriched with the detected domain concepts. The major novel aspect of this approach is semantic interpretation, that is, the association of a complex concept with a complex term. This involves finding the appropriate WordNet concept for each word of a terminological string and the appropriate conceptual relations that hold among the concept components. Semantic interpretation is based on a new word sense disambiguation algorithm, called structural semantic interconnections.

Learning domain ontologies from document warehouses and dedicated web sites

Journal

COMPUTATIONAL LINGUISTICS

Publisher

M I T PRESS

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Learning domain ontologies from document warehouses and dedicated web sites

Journal

COMPUTATIONAL LINGUISTICS

Publisher

M I T PRESS

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper