4.7 Article

Pan-genome analysis of three main Chinese chestnut varieties

Journal

FRONTIERS IN PLANT SCIENCE
Volume 13, Issue -, Pages -

Publisher

FRONTIERS MEDIA SA
DOI: 10.3389/fpls.2022.916550

Keywords

Castanea mollissima; de novo assembly; pan-genome; waxy gene; Nanopore sequencing; genome database

Categories

Funding

  1. National Basic Research Program of China [2013FY111700-2]
  2. China National Key RD Program [2018YFD1000605]
  3. Special Fund for the Construction of Scientific and Technological Innovation Capability [KJCX20200114, PT2022-07]
  4. Presidential Foundation of Institute of Forestry and Pomology [LGY201901]
  5. Youth Scientist Fund of Institute of Forestry and Pomology, Beijing Academy of Agriculture and Forestry Sciences [LGYJJ202007]

Ask authors/readers for more resources

This study reported the genome sequences of three cultivated varieties of Chinese chestnut, obtained high-quality genome assemblies through a combination of various technologies, identified unique gene families and single-copy homologous genes among different varieties, facilitating the study of molecular mechanisms underlying important traits and shortening the breeding process.
Chinese chestnut (Castanea mollissima Blume) is one of the earliest domesticated plants of high nutritional and ecological value, yet mechanisms of C. mollissima underlying its growth and development are poorly understood. Although individual chestnut species differ greatly, the molecular basis of the formation of their characteristic traits remains unknown. Though the draft genomes of chestnut have been previously released, the pan-genome of different variety needs to be studied. We report the genome sequence of three cultivated varieties of chestnut herein, namely Hei-Shan-Zhai-7 (H7, drought-resistant variety), Yan-Hong (YH, easy-pruning variety), and Yan-Shan-Zao-Sheng (ZS, early-maturing variety), to expedite convenience and efficiency in its genetics-based breeding. We obtained three chromosome-level chestnut genome assemblies through a combination of Oxford Nanopore technology, Illumina HiSeq X, and Hi-C mapping. The final genome assemblies are 671.99 Mb (YH), 790.99 Mb (ZS), and 678.90 Mb (H7), across 12 chromosomes, with scaffold N50 sizes of 50.50 Mb (YH), 65.05 Mb (ZS), and 52.16 Mb (H7). Through the identification of homologous genes and the cluster analysis of gene families, we found that H7, YH and ZS had 159, 131, and 91 unique gene families, respectively, and there were 13,248 single-copy direct homologous genes in the three chestnut varieties. For the convenience of research, the chestnut genome database(1) was constructed. Based on the results of gene family identification, the presence/absence variations (PAVs) information of the three sample genes was calculated, and a total of 2,364, 2,232, and 1,475 unique genes were identified in H7, YH and ZS, respectively. Our results suggest that the GBSS II-b gene family underwent expansion in chestnut (relative to nearest source species). Overall, we developed high-quality and well-annotated genome sequences of three C. mollissima varieties, which will facilitate clarifying the molecular mechanisms underlying important traits, and shortening the breeding process.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available