2011 IEEE 11th International Conference on Bioinformatics and Bioengineering
DOI: 10.1109/bibe.2011.49
Full text: Download
A variety of systems exist in which annotations are available at various levels of granularity to a reference coordinate system, such as roads and landmarks on a map, features within a 2-dimensional or 3-dimensional image, or genetic entities (GEs) mapped to a reference genome. As the number of annotations grows, methods to efficiently locate overlapping entities within a specific interval of interest are needed. In this paper, the efficiency of using interval trees for storing, maintaining, and querying large numbers of intervals with special attention to genetic entities is demonstrated. The results suggest a significant speed -- up when compared to relational database approaches. As such, interval trees serve as a suitable alternative for storing and searching annotations to a reference coordinate system.