☆ 4.6 Article

Benchmark and Parameter Sensitivity Analysis of Single-Cell RNA Sequencing Clustering Methods

FRONTIERS IN GENETICS (2019)

期刊

FRONTIERS IN GENETICS

卷 10, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA

DOI: 10.3389/fgene.2019.01253

关键词

single-cell RNA-seq; clustering methods; benchmark; parameter sensitivity analysis; high-dimensional data analysis

类别

Genetics & Heredity

资金

INCIPIT PhD program - COFUND scheme (Marie Sklodowska-Curie Actions) grant [665403]
EPIGEN project
ADViSE project
Marie Curie Actions (MSCA) [665403] Funding Source: Marie Curie Actions (MSCA)

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Single-cell RNA-seq (scRNAseq) is a powerful tool to study heterogeneity of cells. Recently, several clustering based methods have been proposed to identify distinct cell populations. These methods are based on different statistical models and usually require to perform several additional steps, such as preprocessing or dimension reduction, before applying the clustering algorithm. Individual steps are often controlled by method-specific parameters, permitting the method to be used in different modes on the same datasets, depending on the user choices. The large number of possibilities that these methods provide can intimidate non-expert users, since the available choices are not always clearly documented. In addition, to date, no large studies have invistigated the role and the impact that these choices can have in different experimental contexts. This work aims to provide new insights into the advantages and drawbacks of scRNAseq clustering methods and describe the ranges of possibilities that are offered to users. In particular, we provide an extensive evaluation of several methods with respect to different modes of usage and parameter settings by applying them to real and simulated datasets that vary in terms of dimensionality, number of cell populations or levels of noise. Remarkably, the results presented here show that great variability in the performance of the models is strongly attributed to the choice of the user-specific parameter settings. We describe several tendencies in the performance attributed to their modes of usage and different types of datasets, and identify which methods are strongly affected by data dimensionality in terms of computational time. Finally, we highlight some open challenges in scRNAseq data clustering, such as those related to the identification of the number of clusters.

Benchmark and Parameter Sensitivity Analysis of Single-Cell RNA Sequencing Clustering Methods

期刊

FRONTIERS IN GENETICS

出版社

FRONTIERS MEDIA SA

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Benchmark and Parameter Sensitivity Analysis of Single-Cell RNA Sequencing Clustering Methods

期刊

FRONTIERS IN GENETICS

出版社

FRONTIERS MEDIA SA

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文