4.0 Article Proceedings Paper

Ultrafast clustering of single-cell flow cytometry data using FlowGrid

期刊

BMC SYSTEMS BIOLOGY
卷 13, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/s12918-019-0690-2

关键词

Clustering; Flow cytometry; Single cell; DBSCAN

资金

  1. New South Wales Ministry of Health
  2. National Heart Foundation Future Leader Fellowship [100848]
  3. Victor Chang Cardiac Research Institute
  4. National Health and Medical Research Council Career Development Fellowship [1105271]

向作者/读者索取更多资源

BackgroundFlow cytometry is a popular technology for quantitative single-cell profiling of cell surface markers. It enables expression measurement of tens of cell surface protein markers in millions of single cells. It is a powerful tool for discovering cell sub-populations and quantifying cell population heterogeneity. Traditionally, scientists use manual gating to identify cell types, but the process is subjective and is not effective for large multidimensional data. Many clustering algorithms have been developed to analyse these data but most of them are not scalable to very large data sets with more than ten million cells.ResultsHere, we present a new clustering algorithm that combines the advantages of density-based clustering algorithm DBSCAN with the scalability of grid-based clustering. This new clustering algorithm is implemented in python as an open source package, FlowGrid. FlowGrid is memory efficient and scales linearly with respect to the number of cells. We have evaluated the performance of FlowGrid against other state-of-the-art clustering programs and found that FlowGrid produces similar clustering results but with substantially less time. For example, FlowGrid is able to complete a clustering task on a data set of 23.6 million cells in less than 12 seconds, while other algorithms take more than 500 seconds or get into error.ConclusionsFlowGrid is an ultrafast clustering algorithm for large single-cell flow cytometry data. The source code is available at https://github.com/VCCRI/FlowGrid.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.0
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据