4.7 Article

ODGI: understanding pangenome graphs

期刊

BIOINFORMATICS
卷 38, 期 13, 页码 3319-3326

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btac308

关键词

-

资金

  1. National Institutes of Health/NIDA [U01DA047638]
  2. National Institutes of Health/NIGMS [R01GM123489]
  3. NSF PPoSS Award [2118709]
  4. Central Innovation Programme (ZIM) for SMEs of the Federal Ministry for Economic Affairs and Energy of Germany
  5. Germany's Excellence Strategy (CMFI) [EXC2124, (iFIT)-EXC 2180-390900677]
  6. BMBF [031A537B, 031A533A, 031A538A, 031A533B, 031A535A, 031A537C, 031A534A, 031A532B]

向作者/读者索取更多资源

Pangenome graphs provide a complete representation of genomic diversity, but analyzing large-scale genome data using existing tools is challenging. Optimized Dynamic Genome/Graph Implementation (ODGI) is a new tool suite with efficient in-memory representation and support for various operations and visualization. Its parallel execution helps answer complex biological questions quickly.
Motivation: Pangenome graphs provide a complete representation of the mutual alignment of collections of genomes. These models offer the opportunity to study the entire genomic diversity of a population, including structurally complex regions. Nevertheless, analyzing hundreds of gigabase-scale genomes using pangenome graphs is difficult as it is not well-supported by existing tools. Hence, fast and versatile software is required to ask advanced questions to such data in an efficient way. Results: We wrote Optimized Dynamic Genome/Graph Implementation (ODGI), a novel suite of tools that implements scalable algorithms and has an efficient in-memory representation of DNA pangenome graphs in the form of variation graphs. ODGI supports pre-built graphs in the Graphical Fragment Assembly format. ODGI includes tools for detecting complex regions, extracting pangenomic loci, removing artifacts, exploratory analysis, manipulation, validation and visualization. Its fast parallel execution facilitates routine pangenomic tasks, as well as pipelines that can quickly answer complex biological questions of gigabase-scale pangenome graphs.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据