4.7 Article

High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data

期刊

SCIENTIFIC REPORTS
卷 10, 期 1, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/s41598-020-64680-3

关键词

-

资金

  1. Efficient Dairy Genome Project
  2. Genome Canada (Ottawa, Canada)
  3. Genome Alberta (Calgary, Canada)
  4. Ontario Genomics (Toronto, Canada)
  5. Alberta Ministry of Agriculture (Edmonton, Canada)
  6. Ontario Ministry of Research and Innovation (Toronto, Canada)
  7. Ontario Ministry of Agriculture, Food and Rural Affairs (Guelph, Canada)
  8. Canadian Dairy Network (Guelph, Canada)
  9. GrowSafe Systems (Airdrie, Canada)
  10. Alberta Milk (Edmonton, Canada)
  11. Victoria Agriculture (Melbourne, Australia)
  12. Scotland's Rural College (Edinburgh, UK)
  13. USDA Agricultural Research Service (Beltsville, United States)
  14. Qualitas AG (Zug, Switzerland)
  15. Aarhus University (Aarhus, Denmark)
  16. NSERC
  17. Qualitas AG team
  18. Swiss Association for Animal Science

向作者/读者索取更多资源

Multiple methods to detect copy number variants (CNV) relying on different types of data have been developed and CNV have been shown to have an impact on phenotypes of numerous traits of economic importance in cattle, such as reproduction and immunity. Further improvements in CNV detection are still needed in regard to the trade-off between high-true and low-false positive variant identification rates. Instead of improving single CNV detection methods, variants can be identified in silico with high confidence when multiple methods and datasets are combined. Here, CNV were identified from whole-genome sequences (WGS) and genotype array (GEN) data on 96 Holstein animals. After CNV detection, two sets of high confidence CNV regions (CNVR) were created that contained variants found in both WGS and GEN data following an animal-based (n=52) and a population-based (n=36) pipeline. Furthermore, the change in false positive CNV identification rates using different GEN marker densities was evaluated. The population-based approach characterized CNVR, which were more often shared among animals (average 40% more samples per CNVR) and were more often linked to putative functions (48 vs 56% of CNVR) than CNV identified with the animal-based approach. Moreover, false positive identification rates up to 22% were estimated on GEN information. Further research using larger datasets should use a population-wide approach to identify high confidence CNVR.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据