Published in

2011 IEEE 11th International Conference on Bioinformatics and Bioengineering

DOI: 10.1109/bibe.2011.49

Links

Tools

Export citation

Search in Google Scholar

Interval Trees for Detection of Overlapping Genetic Entities

This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

A variety of systems exist in which annotations are available at various levels of granularity to a reference coordinate system, such as roads and landmarks on a map, features within a 2-dimensional or 3-dimensional image, or genetic entities (GEs) mapped to a reference genome. As the number of annotations grows, methods to efficiently locate overlapping entities within a specific interval of interest are needed. In this paper, the efficiency of using interval trees for storing, maintaining, and querying large numbers of intervals with special attention to genetic entities is demonstrated. The results suggest a significant speed -- up when compared to relational database approaches. As such, interval trees serve as a suitable alternative for storing and searching annotations to a reference coordinate system.