IGD: high-performance search for large-scale genomic interval datasets

2Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Databases of large-scale genome projects now contain thousands of genomic interval datasets. These data are a critical resource for understanding the function of DNA. However, our ability to examine and integrate interval data of this scale is limited. Here, we introduce the integrated genome database (IGD), a method and tool for searching genome interval datasets more than three orders of magnitude faster than existing approaches, while using only one hundredth of the memory. IGD uses a novel linear binning method that allows us to scale analysis to billions of genomic regions. Availabilityand implementation: https://github.com/databio/IGD.

Cite

CITATION STYLE

APA

Feng, J., & Sheffield, N. C. (2021). IGD: high-performance search for large-scale genomic interval datasets. Bioinformatics, 37(1), 118–120. https://doi.org/10.1093/bioinformatics/btaa1062

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free