期刊
JOURNAL OF EXPERIMENTAL BOTANY
卷 62, 期 9, 页码 3093-3102出版社
OXFORD UNIV PRESS
DOI: 10.1093/jxb/err029
关键词
Assembly; C-4; next-generation sequencing; transcriptome
资金
- German Research Council (DFG) [SFB 590, SFB TR1, WE2231/4-1]
Next-generation sequencing enables the study of species without a sequenced genome at the 'omics' level. Custom transcriptome databases are generated and global expression profiles can be compared. However, the assembly of transcriptome sequence reads into contigs remains a daunting task. In this study, five different assembly programs, both traditional overlap-based, 'read-centric' assemblers and de Bruijn graph data structure-based assemblers, were compared. To this end, artificial read libraries with and without simulated sequencing errors were constructed from Arabidopsis thaliana, based on quantitative profiles of mature leaf tissue. The open source TGICL pipeline and the commercial CLC bio genomics workbench produced the best assemblies in terms of contig length, hybrid assemblies, redundancy reduction, and error tolerance. The mature leaf transcriptomes of the C-3 species Cleome spinosa and the C-4 species Cleome gynandra were assembled and analysed. The pathways and cellular processes tagged in the transcriptome assemblies reflect processes of a mature leaf. The databases are useful for extracting transcripts related to C-4 processes as full-length or nearly full-length sequences.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据