4.7 Article

Ultra-low-coverage genome-wide association study-insights into gestational age using 17,844 embryo samples with preimplantation genetic testing

Journal

GENOME MEDICINE
Volume 15, Issue 1, Pages -

Publisher

BMC
DOI: 10.1186/s13073-023-01158-7

Keywords

Ultra-low-coverage whole genome sequencing; Imputation; Single-nucleotide polymorphisms; Genome-wide association study; Gestational age; Preterm birth

Ask authors/readers for more resources

This study developed a pipeline for analyzing ultra-low-coverage whole genome sequencing (ulcWGS) data for genome-wide association study (GWAS). The accuracy of genotype imputation under ultra-low coverage was improved by increasing sample size and applying filters. The study identified 11 genomic risk loci associated with gestational ages and 166 genes mapped to these loci. The findings demonstrate the high accuracy of ulcWGS for genotype imputation and provide insights into genetic variations related to gestational age in the Chinese population.
BackgroundVery low-coverage (0.1 to 1x) whole genome sequencing (WGS) has become a promising and affordable approach to discover genomic variants of human populations for genome-wide association study (GWAS). To support genetic screening using preimplantation genetic testing (PGT) in a large population, the sequencing coverage goes below 0.1x to an ultra-low level. However, the feasibility and effectiveness of ultra-low-coverage WGS (ulcWGS) for GWAS remains undetermined.MethodsWe built a pipeline to carry out analysis of ulcWGS data for GWAS. To examine its effectiveness, we benchmarked the accuracy of genotype imputation at the combination of different coverages below 0.1x and sample sizes from 2000 to 16,000, using 17,844 embryo PGT samples with approximately 0.04x average coverage and the standard Chinese sample HG005 with known genotypes. We then applied the imputed genotypes of 1744 transferred embryos who have gestational ages and complete follow-up records to GWAS.ResultsThe accuracy of genotype imputation under ultra-low coverage can be improved by increasing the sample size and applying a set of filters. From 1744 born embryos, we identified 11 genomic risk loci associated with gestational ages and 166 genes mapped to these loci according to positional, expression quantitative trait locus, and chromatin interaction strategies. Among these mapped genes, CRHBP, ICAM1, and OXTR were more frequently reported as preterm birth related. By joint analysis of gene expression data from previous studies, we constructed interrelationships of mainly CRHBP, ICAM1, PLAGL1, DNMT1, CNTLN, DKK1, and EGR2 with preterm birth, infant disease, and breast cancer.ConclusionsThis study not only demonstrates that ulcWGS could achieve relatively high accuracy of adequate genotype imputation and is capable of GWAS, but also provides insights into the associations between gestational age and genetic variations of the fetal embryos from Chinese population.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available