4.5 Article

Extracting 'legacy loci' from an invertebrate sequence capture data set

Journal

ZOOLOGICA SCRIPTA
Volume 51, Issue 1, Pages 14-31

Publisher

WILEY
DOI: 10.1111/zsc.12513

Keywords

legacy loci; next-generation sequencing; Sanger sequencing; sequence capture

Funding

  1. Division of Integrative Organismal Systems [1553100]
  2. Division Of Integrative Organismal Systems
  3. Direct For Biological Sciences [1553100] Funding Source: National Science Foundation

Ask authors/readers for more resources

Sequence capture studies are increasingly being used to recover legacy loci from off-target sequences, with some success in recovering mitochondrial and nuclear legacy loci. Complementing complete mitogenomes with additional reference mitochondrial sequences did not improve phylogenetic analysis significantly, while extracted legacy loci offered substantial improvement. Phylogenetic analysis combining legacy and sequence capture data generally produced congruent results with previous studies, although with lower support. Future studies may benefit from integrating legacy data with sequence capture data without additional expenses.
Sequence capture studies result in rich data sets comprising hundreds to thousands of targeted genomic regions that are superseding Sanger-based data sets comprised of a few well-known loci with historical uses in phylogenetics ('legacy loci'). However, integrating sequence capture and Sanger-based data sets is of interest as legacy loci can include different types of loci (e.g. mitochondrial and nuclear) across a potentially larger sample of species from past studies. Sequence capture data sets include nontargeted sequences, and there has been recent interest in extracting legacy loci from invertebrate data sets. Here, we use published legacy data from leaf-footed bugs (Hemiptera: Coreoidea) to recover 15 mitochondrial and seven nuclear legacy loci from off-target sequences in a sequence capture data set, explore approaches to improve legacy locus recovery, and combine these loci with sequence capture data for phylogenetic analysis. Two nuclear loci were determined to already be targeted by sequence capture baits. Most of the remaining loci were successfully recovered from off-target sequences, but this recovery varied greatly. Additionally, complementing complete mitogenomes with additional reference mitochondrial sequences from a genetic depository did not offer improvement for most of our taxa; however, supplementing these reference sequences with extracted legacy loci offered >= 6% improvement across taxa for a given mitochondrial locus (negligible improvement for nuclear loci). Phylogenetic analysis of legacy and sequence capture data produced a topology generally congruent with recent studies, but support was lower. Thus, future studies may employ the approaches used in this study to integrate legacy data with newly generated sequence capture data sets without added expenses.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available