Dissemin is shutting down on January 1st, 2025

Published in

Oxford University Press, Bioinformatics, 5(31), p. 665-673, 2014

DOI: 10.1093/bioinformatics/btu696

Links

Tools

Export citation

Search in Google Scholar

FlaiMapper: computational annotation of small ncRNA-derived fragments using RNA-seq high-throughput data

Journal article published in 2014 by Youri Hoogstrate ORCID, Guido Jenster ORCID, Elena S. Martens-Uzunova
This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

Motivation: Recent discoveries show that most types of small non-coding RNAs (sncRNAs) such as miRNAs, snoRNAs and tRNAs get further processed into putatively active smaller RNA species. Their roles, genetic profiles and underlying processing mechanisms are only partially understood. To find their quantities and characteristics, a proper annotation is essential. Here, we present FlaiMapper, a method that extracts and annotates the locations of sncRNA-derived RNAs (sncdRNAs). These sncdRNAs are often detected in sequencing data and observed as fragments of their precursor sncRNA. Using small RNA-seq read alignments, FlaiMapper is able to annotate fragments primarily by peak detection on the start and end position densities followed by filtering and a reconstruction process. Results: To assess performance of FlaiMapper, we used independent publicly available small RNA-seq data. We were able to detect fragments representing putative sncdRNAs from nearly all types of sncRNA, including 97.8% of the annotated miRNAs in miRBase that have supporting reads. Comparison of FlaiMapper-predicted boundaries of miRNAs with miRBase entries demonstrated that 89% of the start and 54% of the end positions are identical. Additional benchmarking showed that FlaiMapper is superior in performance compared with existing software. Further analysis indicated a variety of characteristics in the fragments, including sequence motifs and relations with RNA interacting factors. These characteristics set a good basis for further research on sncdRNAs. Availability and implementation: The platform independent GPL licensed Python 2.7 code is available at: https://github.com/yhoogstrate/flaimapper. Corresponding Linux-specific scripts and annotations can be found in the same repository. Contact: e.martens@erasmusmc.nl Supplementary information: Supplementary Data are available at Bioinformatics online.