4.6 Article

PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle

期刊

FRONTIERS IN GENETICS
卷 12, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA
DOI: 10.3389/fgene.2021.664974

关键词

cattle; PacBio single-molecule long-read sequencing; full-length transcript; alternative splicing; alternative polyadenylation; long non-coding RNA

资金

  1. National Natural Science Foundation of China [31872975]
  2. Chinese Academy of Agricultural Sciences of Technology Innovation Project [CAAS-XTCX2016010, CAAS-ZDXT2018006, ASTIP-IAS03]
  3. Program of National Beef Cattle, Yak Industrial Technology System [CARS-37]
  4. China Agriculture Research System of MOF and MARA

向作者/读者索取更多资源

This study utilized PacBio single-molecule real-time data to perform whole-transcriptome profiling in cattle, identifying a large number of full-length non-chimeric sequences and conducting gene transcript structural analysis. It also identified a large number of new genes and transcription factors, enhancing our understanding of the complexity of full-length transcripts in cattle.
Cattle (Bos taurus) is one of the most widely distributed livestock species in the world, and provides us with high-quality milk and meat which have a huge impact on the quality of human life. Therefore, accurate and complete transcriptome and genome annotation are of great value to the research of cattle breeding. In this study, we used error-corrected PacBio single-molecule real-time (SMRT) data to perform whole-transcriptome profiling in cattle. Then, 22.5 Gb of subreads was generated, including 381,423 circular consensus sequences (CCSs), among which 276,295 full-length non-chimeric (FLNC) sequences were identified. After correction by Illumina short reads, we obtained 22,353 error-corrected isoforms. A total of 305 alternative splicing (AS) events and 3,795 alternative polyadenylation (APA) sites were detected by transcriptome structural analysis. Furthermore, we identified 457 novel genes, 120 putative transcription factors (TFs), and 569 novel long non-coding RNAs (lncRNAs). Taken together, this research improves our understanding and provides new insights into the complexity of full-length transcripts in cattle.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据