Journal
BIOINFORMATICS
Volume 37, Issue 1, Pages 118-120Publisher
OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btaa1062
Keywords
-
Categories
Funding
- University of Virginia School of Medicine
- University of Virginia 4-VA program
Ask authors/readers for more resources
IGD is a method and tool for searching genome interval datasets faster and more efficiently, using a novel linear binning method to scale analysis to billions of genomic regions. It is able to handle large-scale genomic data with significantly improved speed and memory efficiency.
Databases of large-scale genome projects now contain thousands of genomic interval datasets. These data are a critical resource for understanding the function of DNA. However, our ability to examine and integrate interval data of this scale is limited. Here, we introduce the integrated genome database (IGD), a method and tool for searching genome interval datasets more than three orders of magnitude faster than existing approaches, while using only one hundredth of the memory. IGD uses a novel linear binning method that allows us to scale analysis to billions of genomic regions.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available