4.8 Article

msRepDB: a comprehensive repetitive sequence database of over 80 000 species

期刊

NUCLEIC ACIDS RESEARCH
卷 50, 期 D1, 页码 D236-D245

出版社

OXFORD UNIV PRESS
DOI: 10.1093/nar/gkab1089

关键词

-

资金

  1. National Natural Science Foundation of China [62002388, 61732009, 61772557, U1909208]
  2. King Abdullah University of Science and Technology (KAUST) Office of Sponsored Research (OSR) [FCC/1/1976-18-01, FCC/1/1976-23-01, FCC/1/1976-25-01, FCC/1/1976-2601, REI/1/0018-01-01, REI/1/4216-01-01, REI/1/443701-01, REI/1/4473-01-01, URF/1/4352-01-01, URF/1/4379-01-01, REI/1/4742-01-01, URF/1/409801-01]
  3. Hunan Provincial Natural Science Foundation of China [2021JJ40787]
  4. Hunan Provincial Science and Technology Program [2018wk4001]
  5. 111 Project [B18059]

向作者/读者索取更多资源

Repeats are common in the genomes of bacteria, plants and animals, playing crucial roles in evolution, inheritance and genomic stability. Comprehensive identification and classification of repeats can contribute to disease diagnosis, plant improvement and drug development.
Repeats are prevalent in the genomes of all bacteria, plants and animals, and they cover nearly half of the Human genome, which play indispensable roles in the evolution, inheritance, variation and genomic instability, and serve as substrates for chromosomal rearrangements that include disease-causing deletions, inversions, and translocations. Comprehensive identification, classification and annotation of repeats in genomes can provide accurate and targeted solutions towards understanding and diagnosis of complex diseases, optimization of plant properties and development of new drugs. RepBase and Dfam are two most frequently used repeat databases, but they are not sufficiently complete. Due to the lack of a comprehensive repeat database of multiple species, the current research in this field is far from being satisfactory. LongRepMarker is a new framework developed recently by our group for comprehensive identification of genomic repeats. We here propose msRepDB based on LongRepMarker, which is currently the most comprehensive multi-species repeat database, covering >80 000 species. Comprehensive evaluations show that msRepDB contains more species, and more complete repeats and families than RepBase and Dfam databases. (https://msrepdb.cbrc.kaust.edu.sa/pages/msRepDB/index.html).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据