4.7 Article

High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data

Journal

SCIENTIFIC REPORTS
Volume 10, Issue 1, Pages -

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41598-020-64680-3

Keywords

-

Funding

  1. Efficient Dairy Genome Project
  2. Genome Canada (Ottawa, Canada)
  3. Genome Alberta (Calgary, Canada)
  4. Ontario Genomics (Toronto, Canada)
  5. Alberta Ministry of Agriculture (Edmonton, Canada)
  6. Ontario Ministry of Research and Innovation (Toronto, Canada)
  7. Ontario Ministry of Agriculture, Food and Rural Affairs (Guelph, Canada)
  8. Canadian Dairy Network (Guelph, Canada)
  9. GrowSafe Systems (Airdrie, Canada)
  10. Alberta Milk (Edmonton, Canada)
  11. Victoria Agriculture (Melbourne, Australia)
  12. Scotland's Rural College (Edinburgh, UK)
  13. USDA Agricultural Research Service (Beltsville, United States)
  14. Qualitas AG (Zug, Switzerland)
  15. Aarhus University (Aarhus, Denmark)
  16. NSERC
  17. Qualitas AG team
  18. Swiss Association for Animal Science

Ask authors/readers for more resources

Multiple methods to detect copy number variants (CNV) relying on different types of data have been developed and CNV have been shown to have an impact on phenotypes of numerous traits of economic importance in cattle, such as reproduction and immunity. Further improvements in CNV detection are still needed in regard to the trade-off between high-true and low-false positive variant identification rates. Instead of improving single CNV detection methods, variants can be identified in silico with high confidence when multiple methods and datasets are combined. Here, CNV were identified from whole-genome sequences (WGS) and genotype array (GEN) data on 96 Holstein animals. After CNV detection, two sets of high confidence CNV regions (CNVR) were created that contained variants found in both WGS and GEN data following an animal-based (n=52) and a population-based (n=36) pipeline. Furthermore, the change in false positive CNV identification rates using different GEN marker densities was evaluated. The population-based approach characterized CNVR, which were more often shared among animals (average 40% more samples per CNVR) and were more often linked to putative functions (48 vs 56% of CNVR) than CNV identified with the animal-based approach. Moreover, false positive identification rates up to 22% were estimated on GEN information. Further research using larger datasets should use a population-wide approach to identify high confidence CNVR.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available