4.7 Article

IGD: high-performance search for large-scale genomic interval datasets

Journal

BIOINFORMATICS
Volume 37, Issue 1, Pages 118-120

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btaa1062

Keywords

-

Funding

  1. University of Virginia School of Medicine
  2. University of Virginia 4-VA program

Ask authors/readers for more resources

IGD is a method and tool for searching genome interval datasets faster and more efficiently, using a novel linear binning method to scale analysis to billions of genomic regions. It is able to handle large-scale genomic data with significantly improved speed and memory efficiency.
Databases of large-scale genome projects now contain thousands of genomic interval datasets. These data are a critical resource for understanding the function of DNA. However, our ability to examine and integrate interval data of this scale is limited. Here, we introduce the integrated genome database (IGD), a method and tool for searching genome interval datasets more than three orders of magnitude faster than existing approaches, while using only one hundredth of the memory. IGD uses a novel linear binning method that allows us to scale analysis to billions of genomic regions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available