4.5 Article

fjoin: Simple and efficient computation of feature overlaps

Journal

JOURNAL OF COMPUTATIONAL BIOLOGY
Volume 13, Issue 8, Pages 1457-1464

Publisher

MARY ANN LIEBERT, INC
DOI: 10.1089/cmb.2006.13.1457

Keywords

features; intervals; overlap computation; algorithm; implementation

Ask authors/readers for more resources

Sets of biological features with genome coordinates ( e. g., genes and promoters) are a particularly common form of data in bioinformatics today. Accordingly, an increasingly important processing step involves comparing coordinates from large sets of features to find overlapping feature pairs. This paper presents fjoin, an efficient, robust, and simple algorithm for finding these pairs, and a downloadable implementation. For typical bioinformatics feature sets, fjoin requires O(n log( n)) time (O( n) if the inputs are sorted) and uses O(1) space. The reference implementation is a stand-alone Python program; it implements the basic algorithm and a number of useful extensions, which are also discussed in this paper.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available