3.8 Article Proceedings Paper

Comparing clusterings by the variation of information

Journal

LEARNING THEORY AND KERNEL MACHINES
Volume 2777, Issue -, Pages 173-187

Publisher

SPRINGER-VERLAG BERLIN
DOI: 10.1007/978-3-540-45167-9_14

Keywords

clustering; comparing partitions; measures of agreement; information theory; mutual information

Ask authors/readers for more resources

This paper proposes an information theoretic criterion for comparing two partitions, or clusterings, of the same data set. The criterion, called variation of information (VI), measures the amount of information lost and gained in changing from clustering C to clustering C'. The criterion makes no assumptions about how the clusterings were generated and applies to both soft and hard clusterings. The basic properties of VI are presented and discussed from the point of view of comparing clusterings. In particular, the VI is positive, symmetric and obeys the triangle inequality. Thus, surprisingly enough, it is a true metric on the space of clusterings.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available