4.8 Article

The Dfam database of repetitive DNA families

期刊

NUCLEIC ACIDS RESEARCH
卷 44, 期 D1, 页码 D81-D89

出版社

OXFORD UNIV PRESS
DOI: 10.1093/nar/gkv1272

关键词

-

资金

  1. Howard Hughes Medical Institute Janelia Research Campus
  2. National Institutes of Health [P41LM006252-1, RO1 HG002939]
  3. University of Montana University Grant Program

向作者/读者索取更多资源

Repetitive DNA, especially that due to transposable elements (TEs), makes up a large fraction of many genomes. Dfam is an open access database of families of repetitive DNA elements, in which each family is represented by a multiple sequence alignment and a profile hidden Markov model (HMM). The initial release of Dfam, featured in the 2013 NAR Database Issue, contained 1143 families of repetitive elements found in humans, and was used to produce more than 100 Mb of additional annotation of TE-derived regions in the human genome, with improved speed. Here, we describe recent advances, most notably expansion to 4150 total families including a comprehensive set of known repeat families from four new organisms (mouse, zebrafish, fly and nematode). We describe improvements to coverage, and to our methods for identifying and reducing false annotation. We also describe updates to the website interface. The Dfam website has moved to http://dfam.org. Seed alignments, profile HMMs, hit lists and other underlying data are available for download.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据