4.7 Article

Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences

期刊

FRONTIERS IN PLANT SCIENCE
卷 12, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA
DOI: 10.3389/fpls.2021.609729

关键词

plastome expansion; repeat sequence; hybrid assembly; AT-biased base composition; long-read sequencing; palindromic repeat; inversion

资金

  1. National Natural Science Foundation of China [U1804117]
  2. Key Scientific Research Projects of Henan Province [17A180023]

向作者/读者索取更多资源

This study presented the sequencing, assembly and annotation of the chloroplast genomes of Cypripedium tibeticum and Cypripedium subtropicum, revealing their expanded genome size, low GC content, proliferation of AT-rich repeat sequences, and gene density. The increase in genome size is attributed to the proliferation of AT-biased non-coding regions, indicating a typical example of plastome expansion induced by non-coding regions in the genus. Hybrid assembly based on long and short reads is recommended for sequencing plastomes with AT-biased base composition.
The size of the chloroplast genome (plastome) of autotrophic angiosperms is generally conserved. However, the chloroplast genomes of some lineages are greatly expanded, which may render assembling these genomes from short read sequencing data more challenging. Here, we present the sequencing, assembly, and annotation of the chloroplast genomes of Cypripedium tibeticum and Cypripedium subtropicum. We de novo assembled the chloroplast genomes of the two species with a combination of short-read Illumina data and long-read PacBio data. The plastomes of the two species are characterized by expanded genome size, proliferated AT-rich repeat sequences, low GC content and gene density, as well as low substitution rates of the coding genes. The plastomes of C. tibeticum (197,815 bp) and C. subtropicum (212,668 bp) are substantially larger than those of the three species sequenced in previous studies. The plastome of C. subtropicum is the longest one of Orchidaceae to date. Despite the increase in genome size, the gene order and gene number of the plastomes are conserved, with the exception of an similar to 75 kb large inversion in the large single copy (LSC) region shared by the two species. The most striking is the record-setting low GC content in C. subtropicum (28.2%). Moreover, the plastome expansion of the two species is strongly correlated with the proliferation of AT-biased non-coding regions: the non-coding content of C. subtropicum is in excess of 57%. The genus provides a typical example of plastome expansion induced by the expansion of non-coding regions. Considering the pros and cons of different sequencing technologies, we recommend hybrid assembly based on long and short reads applied to the sequencing of plastomes with AT-biased base composition.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据