☆ 4.7 Article

Synthetic data sets for the identification of key ingredients for RNA-seq differential analysis

BRIEFINGS IN BIOINFORMATICS (2018)

期刊

BRIEFINGS IN BIOINFORMATICS

卷 19, 期 1, 页码 65-76

出版社

OXFORD UNIV PRESS

DOI: 10.1093/bib/bbw092

关键词

RNA-seq; differential analysis; benchmark data set

类别

Biochemical Research Methods Mathematical & Computational Biology

资金

French Agence Nationale de la Recherche (project MixStatSeq) [ANR-13-JS01-0001-01]
French Institut National de la Recherche Agronomique AIP bioressource project
GIS Infrastructures en Biologie Sante et Agronomie AO Platform call
LabEx Saclay Plant Sciences-SPS [ANR-10-LABX-0040-SPS]
Agence Nationale de la Recherche (ANR) [ANR-13-JS01-0001] Funding Source: Agence Nationale de la Recherche (ANR)

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Numerous statistical pipelines are now available for the differential analysis of gene expression measured with RNA-sequencing technology. Most of them are based on similar statistical frameworks after normalization, differing primarily in the choice of data distribution, mean and variance estimation strategy and data filtering. We propose an evaluation of the impact of these choices when few biological replicates are available through the use of synthetic data sets. This framework is based on real data sets and allows the exploration of various scenarios differing in the proportion of non-differentially expressed genes. Hence, it provides an evaluation of the key ingredients of the differential analysis, free of the biases associated with the simulation of data using parametric models. Our results show the relevance of a proper modeling of the mean by using linear or generalized linear modeling. Once the mean is properly modeled, the impact of the other parameters on the performance of the test is much less important. Finally, we propose to use the simple visualization of the raw P-value histogram as a practical evaluation criterion of the performance of differential analysis methods on real data sets.

Synthetic data sets for the identification of key ingredients for RNA-seq differential analysis

期刊

BRIEFINGS IN BIOINFORMATICS

出版社

OXFORD UNIV PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Synthetic data sets for the identification of key ingredients for RNA-seq differential analysis

期刊

BRIEFINGS IN BIOINFORMATICS

出版社

OXFORD UNIV PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文