4.6 Article

Aiming off the target: recycling target capture sequencing reads for investigating repetitive DNA

期刊

ANNALS OF BOTANY
卷 128, 期 7, 页码 835-848

出版社

OXFORD UNIV PRESS
DOI: 10.1093/aob/mcab063

关键词

Genome skimming; holocentric; reduced-representation sequencing; RepeatExplorer; Rhynchospora; satellite DNA; transposable elements

资金

  1. Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior-Brasil (CAPES) [001]
  2. CAPES-PRINT (Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior -Programa Institucional de Internacionalizacao) [88887.363884/2019-00]
  3. CNPq (Conselho Nacional de Desenvolvimento Cientifico e Tecnologico) [141037/2018-0]

向作者/读者索取更多资源

The study demonstrated the potential of utilizing off-target TCS reads to identify and characterize repetitive sequences in plant genomes, which has important implications for cytogenetic and phylogenomic research. The reliability of repeat information obtained from TCS data was high when compared with GS data, and the repeat-based phylogenies constructed from TCS data showed congruence with genetic alignment-based trees.
Background and Aims With the advance of high-throughput sequencing, reduced-representation methods such as target capture sequencing (TCS) emerged as cost-efficient ways of gathering genomic information, particularly from coding regions. As the off-target reads from such sequencing are expected to be similar to genome skimming (GS), we assessed the quality of repeat characterization in plant genomes using these data. Methods Repeat composition obtained from TCS datasets of five Rhynchospora (Cyperaceae) species were compared with GS data from the same taxa. In addition, a FISH probe was designed based on the most abundant satellite found in the TCS dataset of Rhynchospora cephalotes. Finally, repeat-based phylogenies of the five Rhynchospora species were constructed based on the GS and TCS datasets and the topologies were compared with a gene-alignment-based phylogenetic tree. Key Results All the major repetitive DNA families were identified in TCS, including repeats that showed abundances as low as 0.01 % in the GS data. Rank correlations between GS and TCS repeat abundances were moderately high (r = 0.58-0.85), increasing after filtering out the targeted loci from the raw TCS reads (r = 0.66-0.92). Repeat data obtained by TCS were also reliable in developing a cytogenetic probe of a new variant of the holocentromeric satellite Tyba. Repeat-based phylogenies from TCS data were congruent with those obtained from GS data and the gene-alignment tree. Conclusions Our results show that off-target TCS reads can be recycled to identify repeats for cyto- and phylogenomic investigations. Given the growing availability of TCS reads, driven by global phylogenomic projects, our strategy represents a way to recycle genomic data and contribute to a better characterization of plant biodiversity.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据