Published in

Oxford University Press, Bioinformatics, 16(33), p. 2594-2595, 2017

DOI: 10.1093/bioinformatics/btx206

Links

Tools

Export citation

Search in Google Scholar

RTK: efficient rarefaction analysis of large datasets

Journal article published in 2017 by Paul Saary, Kristoffer Forslund, Peer Bork, Falk Hildebrand ORCID
This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

Abstract Motivation The rapidly expanding microbiomics field is generating increasingly larger datasets, characterizing the microbiota in diverse environments. Although classical numerical ecology methods provide a robust statistical framework for their analysis, software currently available is inadequate for large datasets and some computationally intensive tasks, like rarefaction and associated analysis. Results Here we present a software package for rarefaction analysis of large count matrices, as well as estimation and visualization of diversity, richness and evenness. Our software is designed for ease of use, operating at least 7x faster than existing solutions, despite requiring 10x less memory. Availability and Implementation C ++ and R source code (GPL v.2) as well as binaries are available from https://github.com/hildebra/Rarefaction and from CRAN (https://cran.r-project.org/). Supplementary information Supplementary data are available at Bioinformatics online.