☆ 4.8 Article

Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation

NUCLEIC ACIDS RESEARCH (2018)

期刊

NUCLEIC ACIDS RESEARCH

卷 46, 期 D1, 页码 D221-D228

出版社

OXFORD UNIV PRESS

DOI: 10.1093/nar/gkx1031

关键词

类别

Biochemistry & Molecular Biology

资金

National Institutes of Health, National Library of Medicine
Wellcome Trust [WT098051, WT108749/Z/15/Z, 099129/Z/12/Z]
National Human Genome Research Institute (NHGRI) [U41HG007234, 2U41HG007234]
European Molecular Biology Laboratory
NHGRI grant [U41HG007234, U41HG003345, U41HG000330]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The Consensus Coding Sequence (CCDS) project provides a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assembly in genome annotations produced independently by NCBI and the Ensembl group at EMBL-EBI. This dataset is the product of an international collaboration that includes NCBI, Ensembl, HUGO Gene Nomenclature Committee, Mouse Genome Informatics and University of California, Santa Cruz. Identically annotated coding regions, which are generated using an automated pipeline and pass multiple quality assurance checks, are assigned a stable and tracked identifier (CCDS ID). Additionally, coordinated manual review by expert curators from the CCDS collaboration helps in maintaining the integrity and high quality of the dataset. The CCDS data are available through an interactive web page (https://www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi) and an FTP site (ftp://ftp.ncbi.nlm.nih.gov/pub/CCDS/). In this paper, we outline the ongoing work, growth and stability of the CCDS dataset and provide updates on new collaboration members and new features added to the CCDS user interface. We also present expert curation scenarios, with specific examples highlighting the importance of an accurate reference genome assembly and the crucial role played by input from the research community.

Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation

期刊

NUCLEIC ACIDS RESEARCH

出版社

OXFORD UNIV PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation

期刊

NUCLEIC ACIDS RESEARCH

出版社

OXFORD UNIV PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文