4.3 Article

Allele Identification for Transcriptome-Based Population Genomics in the Invasive Plant Centaurea solstitialis

Journal

G3-GENES GENOMES GENETICS
Volume 3, Issue 2, Pages 359-367

Publisher

GENETICS SOCIETY AMERICA
DOI: 10.1534/g3.112.003871

Keywords

normalized ESTs; allele clustering; 454 GS FLX Titanium; yellow starthistle; invasive species

Funding

  1. Roche Applied Science/454 Life Sciences
  2. Natural Sciences and Engineering Research Council of Canada (NSERC) [353026]
  3. Division Of Integrative Organismal Systems
  4. Direct For Biological Sciences [820451] Funding Source: National Science Foundation

Ask authors/readers for more resources

Transcriptome sequences are becoming more broadly available for multiple individuals of the same species, providing opportunities to derive population genomic information from these datasets. Using the 454 Life Science Genome Sequencer FLX and FLX-Titanium next-generation platforms, we generated 112430 Mbp of sequence for normalized cDNA for 40 wild genotypes of the invasive plant Centaurea solstitialis, yellow starthistle, from across its worldwide distribution. We examined the impact of sequencing effort on transcriptome recovery and overlap among individuals. To do this, we developed two novel publicly available software pipelines: SnoWhite for read cleaning before assembly, and AllelePipe for clustering of loci and allele identification in assembled datasets with or without a reference genome. AllelePipe is designed specifically for cases in which read depth information is not appropriate or available to assist with disentangling closely related paralogs from allelic variation, as in transcriptome or previously assembled libraries. We find that modest applications of sequencing effort recover most of the novel sequences present in the transcriptome of this species, including single-copy loci and a representative distribution of functional groups. In contrast, the coverage of variable sites, observation of heterozygosity, and overlap among different libraries are all highly dependent on sequencing effort. Nevertheless, the information gained from overlapping regions was informative regarding coarse population structure and variation across our small number of population samples, providing the first genetic evidence in support of hypothesized invasion scenarios.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available