4.4 Article

Keeping up with the changing Web

Journal

COMPUTER
Volume 33, Issue 5, Pages 52-+

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/2.841784

Keywords

-

Ask authors/readers for more resources

Because information depreciates over time, keeping Web pages current presents new design challenges. This article quantifies what current means for Web search engines and estimates how often they must reindex the Web to keep current with its changing pages and structure. Most information-from a newspaper story to a temperature sensor measurement to a Web page-is dynamic. When monitoring an information source, when do our previous observations become stale and need refreshing., How can we schedule these refresh operations to satisfy a required level of currency without violating resource constraints-such as bandwidth or computing limitations on how much data can be observed in a given time? The authors investigate the trade-offs involved in monitoring dynamic information sources and discuss the Web in detail, estimating how fast exploring what constitutes a current Web index. For a simple class of Web-monitoring systems-seach-engines- they combine their idea of currency with actual measured data to estimate revisit documents change and rates.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available