4.8 Article

Genotyping and De Novo Discovery of Allelic Variants at the Brassicaceae Self-Incompatibility Locus from Short-Read Sequencing Data

Journal

MOLECULAR BIOLOGY AND EVOLUTION
Volume 37, Issue 4, Pages 1193-1201

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/molbev/msz258

Keywords

highly polymorphic loci; self-incompatibility; bioinformatic pipeline; balancing selection; plant reproduction

Funding

  1. European Research Council [648321]
  2. Region Hauts-de-France
  3. Ministre de l'Enseignement Superieur et de la Recherche (CPER Climibio)
  4. European Fund for Regional Economic Development
  5. European Research Council (ERC) [648321] Funding Source: European Research Council (ERC)

Ask authors/readers for more resources

Plant self-incompatibility (SI) is a genetic system that prevents selfing and enforces outcrossing. Because of strong balancing selection, the genes encoding SI are predicted to maintain extraordinarily high levels of polymorphism, both in terms of the number of functionally distinct S-alleles that segregate in SI species and in terms of their nucleotide sequence divergence. However, because of these two combined features, documenting polymorphism of these genes also presents importantmethodological challenges that have so far largely prevented the comprehensive analysis of complete allelic series in natural populations, and also precluded the obtention of complete genic sequences for many S-alleles. Here, we develop a powerful methodological approach based on a computationally optimized comparison of short Illumina sequencing reads from genomic DNA to a database of known nucleotide sequences of the extracellular domain of SRK (eSRK). By examining mapping patterns along the reference sequences, we obtain highly reliable predictions of Sgenotypes from individuals collected from natural populations of Arabidopsis halleri. Furthermore, using a de novo assembly approach of the filtered short reads, we obtain full-length sequences of eSRK even when the initial sequence in the database was only partial, and we discover putative new SRK alleles that were not initially present in the database. When including those new alleles in the reference database, we were able to resolve the complete diploid SI genotypes of all individuals. Beyond the specific case of Brassicaceae S-alleles, our approach can be readily applied to other polymorphic loci, given reference allelic sequences are available.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available