4.7 Article

Clusterdv: a simple density-based clustering method that is robust, general and automatic

Journal

BIOINFORMATICS
Volume 35, Issue 12, Pages 2125-2132

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/bty932

Keywords

-

Funding

  1. Portuguese Fundacao para a Ciencia e Tecnologia (FCT)
  2. Bial Foundation [185/12]
  3. Marie Curie [FP7-PEOPLE-2011-CIG]
  4. FCT [PTDC/NEU-NMC/1276/2012]
  5. European Research Council [ERC-2017-COG-773012]
  6. Fundação para a Ciência e a Tecnologia [PTDC/NEU-NMC/1276/2012] Funding Source: FCT

Ask authors/readers for more resources

Motivation How to partition a dataset into a set of distinct clusters is a ubiquitous and challenging problem. The fact that data vary widely in features such as cluster shape, cluster number, density distribution, background noise, outliers and degree of overlap, makes it difficult to find a single algorithm that can be broadly applied. One recent method, clusterdp, based on search of density peaks, can be applied successfully to cluster many kinds of data, but it is not fully automatic, and fails on some simple data distributions. Results We propose an alternative approach, clusterdv, which estimates density dips between points, and allows robust determination of cluster number and distribution across a wide range of data, without any manual parameter adjustment. We show that this method is able to solve a range of synthetic and experimental datasets, where the underlying structure is known, and identifies consistent and meaningful clusters in new behavioral data. Availability and implementation The clusterdv is implemented in Matlab. Its source code, together with example datasets are available on: https://github.com/jcbmarques/clusterdv. Supplementary information Supplementary data are available at Bioinformatics online.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available