4.5 Article

Focused crawling of tagged web resources using ontology

Journal

COMPUTERS & ELECTRICAL ENGINEERING
Volume 39, Issue 2, Pages 613-628

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.compeleceng.2012.09.009

Keywords

-

Ask authors/readers for more resources

Scrutinizing web resources of interest from a large number of search results is a tedious task for any web user. Fortunately, social sites such as Social Bookmarking Site (SBS) allow web users to store their preferences and searched results of their interest in the form of bookmarks. Such sites however contain lots of irrelevant data as noise and, predicting relevant URLs from the noise is a real challenge. With intent to overcome the challenge, this paper proposes a focused crawler, FCHC that mimics a human cognitive search pattern to find potentially relevant web resources from a SBS. The focused crawler utilizes domain specific Concept Ontology to semantically expand a search topic and to determine Semantic Relevance of tags. The crawler is tested with different search patterns on the 'database' domain and evaluated using a well established metric, harvest ratio. The performance of FCHC is analyzed and compared with focused crawlers that crawl the WWW using ontology and, without ontology. (C) 2012 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available