4.7 Article Data Paper

De novo assembly of the cattle reference genome with single-molecule sequencing

Journal

GIGASCIENCE
Volume 9, Issue 3, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/gigascience/giaa021

Keywords

bovine genome; reference assembly; cattle; Hereford

Funding

  1. USDA/NRSP-8 Animal Genome
  2. USDA-ARS Meat Animal Research Center
  3. Neogen
  4. Zoetis
  5. USDA [CRIS 8042-31000-001-00-D, CRIS 8042-31000-002-00-D, CRIS 5090-31000-026-00-D, CRIS 3040-31000-100-00-D]
  6. USDA NIFA [5090-31000-026-06-I, 2016-68004-24827, 2013-67015-21202, 2015-67015-23183]
  7. NIH [1R01HD084353-01A1]
  8. USDA Hatch [MO-HAAS0001]
  9. Intramural Research Program of the National Library of Medicine, National Institutes of Health
  10. UKRI-BBSRC [BB/M027155/1, BBS/E/I/00007035, BBS/E/I/00007038, BBS/E/I/00007039]
  11. Intramural Research Program of the National Human Genome Research Institute, National Institutes of Health
  12. Korean Visiting Scientist Training Award (KVSTA) through the Korea Health Industry Development Institute (KHIDI) - Ministry of Health Welfare [HI17C2098]
  13. BBSRC [BBS/E/I/00007035, BB/M027155/1, BBS/E/I/00007039, BBS/E/I/00007038] Funding Source: UKRI

Ask authors/readers for more resources

Background: Major advances in selection progress for cattle have been made following the introduction of genomic tools over the past 10-12 years. These tools depend upon the Bos taurus reference genome (UMD3.1.1), which was created using now-outdated technologies and is hindered by a variety of deficiencies and inaccuracies. Results: We present the new reference genome for cattle, ARS-UCD1.2, based on the same animal as the original to facilitate transfer and interpretation of results obtained from the earlier version, but applying a combination of modern technologies in a de novo assembly to increase continuity, accuracy, and completeness. The assembly includes 2.7 Gb and is >250x more continuous than the original assembly, with contig N50 >25 Mb and L50 of 32. We also greatly expanded supporting RNA-based data for annotation that identifies 30,396 total genes (21,039 protein coding). The new reference assembly is accessible in annotated form for public use. Conclusions: We demonstrate that improved continuity of assembled sequence warrants the adoption of ARS-UCD1.2 as the new cattle reference genome and that increased assembly accuracy will benefit future research on this species.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available