4.5 Article

A pipeline for effectively developing highly polymorphic simple sequence repeats markers based on multi-sample genomic data

Journal

ECOLOGY AND EVOLUTION
Volume 12, Issue 3, Pages -

Publisher

WILEY
DOI: 10.1002/ece3.8705

Keywords

microsatellite; molecular marker; next-generation sequencing; resequencing; short tandem repeats; threatened species

Funding

  1. Ministry of Science and Technology of the People's Republic of China [2016YFC0503200]
  2. National Natural Science Foundation of China [31872244]
  3. Ministry of Ecology and Environment, The People's Republic of China [2019HB2096001006]

Ask authors/readers for more resources

The study introduces a pipeline for the rapid development of polymorphic SSR markers from multi-sample genomic data. By genotyping multiple individuals using bioinformatic software based on resequencing data, highly polymorphic SSRs can be detected prior to experimental validation, significantly improving efficiency and reducing experimental effort. Overall, the pipeline demonstrates a successful application of NGS technology for mining and developing molecular markers for ecological and evolutionary studies.
Simple sequence repeats (SSRs) are widely used genetic markers in ecology, evolution, and conservation even in the genomics era, while a general limitation to their application is the difficulty of developing polymorphic SSR markers. Next-generation sequencing (NGS) offers the opportunity for the rapid development of SSRs; however, previous studies developing SSRs using genomic data from only one individual need redundant experiments to test the polymorphisms of SSRs. In this study, we designed a pipeline for the rapid development of polymorphic SSR markers from multi-sample genomic data. We used bioinformatic software to genotype multiple individuals using resequencing data, detected highly polymorphic SSRs prior to experimental validation, significantly improved the efficiency and reduced the experimental effort. The pipeline was successfully applied to a globally threatened species, the brown eared-pheasant (Crossoptilon montchuricum), which showed very low genomic diversity. The 20 newly developed SSR markers were highly polymorphic, the average number of alleles was much higher than the genomic average. We also evaluated the effect of the number of individuals and sequencing depth on the SSR mining results, and we found that 10 individuals and similar to 10X sequencing data were enough to obtain a sufficient number of polymorphic SSRs, even for species with low genetic diversity. Furthermore, the genome assembly of NGS data from the optimal number of individuals and sequencing depth can be used as an alternative reference genome if a high-quality genome is not available. Our pipeline provided a paradigm for the application of NGS technology to mining and developing molecular markers for ecological and evolutionary studies.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available