4.7 Article

Full-length SMRT transcriptome sequencing and microsatellite characterization in Paulownia catalpifolia

期刊

SCIENTIFIC REPORTS
卷 11, 期 1, 页码 -

出版社

NATURE RESEARCH
DOI: 10.1038/s41598-021-87538-8

关键词

-

资金

  1. Fundamental Research Funds for the Central Non-profit Research Institution of Chinese Academy of Forestry [CAFYBB2017ZA001-6]

向作者/读者索取更多资源

In this study, the full-length transcriptome of Paulownia catalpifolia leaves under varying degrees of drought stress was obtained using single-molecule real-time sequencing technology, revealing numerous microsatellites. These findings provide a valuable reference for exploring the genetic resources and breeding of drought-resistant varieties in Paulownia catalpifolia.
Paulownia catalpifolia is an important, fast-growing timber species known for its high density, color and texture. However, few transcriptomic and genetic studies have been conducted in P. catalpifolia. In this study, single-molecule real-time sequencing technology was applied to obtain the full-length transcriptome of P. catalpifolia leaves treated with varying degrees of drought stress. The sequencing data were then used to search for microsatellites, or simple sequence repeats (SSRs). A total of 28.83 Gb data were generated, 25,969 high-quality (HQ) transcripts with an average length of 1624 bp were acquired after removing the redundant reads, and 25,602 HQ transcripts (98.59%) were annotated using public databases. Among the HQ transcripts, 16,722 intact coding sequences, 149 long non-coding RNAs and 179 alternative splicing events were predicted, respectively. A total of 7367 SSR loci were distributed throughout 6293 HQ transcripts, of which 763 complex SSRs and 6604 complete SSRs. The SSR appearance frequency was 28.37%, and the average distribution distance was 5.59 kb. Among the 6604 complete SSR loci, 1-3 nucleotide repeats were dominant, occupying 97.85% of the total SSR loci, of which mono-, di- and tri-nucleotide repeats were 44.68%, 33.86% and 19.31%, respectively. We detected 112 repeat motifs, of which A/T (42.64%), AG/CT (12.22%), GA/TC (9.63%), GAA/TTC (1.57%) and CCA/TGG (1.54%) were most common in mono-, di- and tri-nucleotide repeats, respectively. The length of the repeat SSR motifs was 10-88 bp, and 4997 (75.67%) were <= 20 bp. This study provides a novel full-length transcriptome reference for P. catalpifolia and will facilitate the identification of germplasm resources and breeding of new drought-resistant P. catalpifolia varieties.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据