期刊
NATURE METHODS
卷 10, 期 12, 页码 1185-+出版社
NATURE PORTFOLIO
DOI: 10.1038/NMETH.2722
关键词
-
资金
- European Molecular Biology Laboratory
- US National Institutes of Health/NHGRI [U54HG004555, U54HG004557]
- Wellcome Trust [WT09805]
- Ministerio de Educacion y Ciencia [BIO2011-26205, CSD2007-00050]
- Direct For Biological Sciences
- Emerging Frontiers [0850237] Funding Source: National Science Foundation
- Direct For Computer & Info Scie & Enginr [1054631] Funding Source: National Science Foundation
- Div Of Information & Intelligent Systems [1054631] Funding Source: National Science Foundation
High-throughput RNA sequencing is an increasingly accessible method for studying gene structure and activity on a genome-wide scale. A critical step in RNA-seq data analysis is the alignment of partial transcript reads to a reference genome sequence. To assess the performance of current mapping software, we invited developers of RNA-seq aligners to process four large human and mouse RNA-seq data sets. In total, we compared 26 mapping protocols based on 11 programs and pipelines and found major performance differences between methods on numerous benchmarks, including alignment yield, basewise accuracy, mismatch and gap placement, exon junction discovery and suitability of alignments for transcript reconstruction. We observed concordant results on real and simulated RNA-seq data, confirming the relevance of the metrics employed. Future developments in RNA-seq alignment methods would benefit from improved placement of multimapped reads, balanced utilization of existing gene annotation and a reduced false discovery rate for splice junctions.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据