4.3 Article

Recovery guarantees for exemplar-based clustering

Journal

INFORMATION AND COMPUTATION
Volume 245, Issue -, Pages 165-180

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.ic.2015.09.002

Keywords

Exact recovery; k-Medoids; Linear programming; Separated balls

Funding

  1. Alfred P. Sloan Foundation
  2. ONR [N00014-12-1-0743]
  3. NSF CAREER Award
  4. AFOSR Young Investigator Program Award
  5. National Institutes of Health [R01CA163336]

Ask authors/readers for more resources

For a certain class of distributions, we prove that the linear programming relaxation of k-medoids clustering - a variant of k-means clustering where means are replaced by exemplars from within the dataset-distinguishes points drawn from nonoverlapping balls with high probability once the number of points drawn and the separation distance between any two balls are sufficiently large. Our results hold in the nontrivial regime where the separation distance is small enough that points drawn from different balls may be closer to each other than points drawn from the same ball; in this case, clustering by thresholding pairwise distances between points can fail. We also exhibit numerical evidence of high-probability recovery in a substantially more permissive regime. (C) 2015 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available