4.7 Article

A formal derivation of Heaps' law

Journal

INFORMATION SCIENCES
Volume 170, Issue 2-4, Pages 263-272

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2004.03.006

Keywords

-

Ask authors/readers for more resources

Word frequencies in text documents can be reasonably described by the Mandelbrot distribution, which has Zipf's Law as a special case. Furthermore, the growth of vocabulary size as a function of the text size (its number of words) has been described in Heaps' Law. It has been shown that these two experimental laws are related. In this paper we go a step further, and provide a (formal) derivation of Heaps' Law from the Mandelbrot distribution. We also provide a specification of the validity area for applying Heaps' Law. (C) 2004 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available