Dissemin is shutting down on January 1st, 2025

Published in

Oxford University Press (OUP), Briefings in Bioinformatics, 5(15), p. 783-787

DOI: 10.1093/bib/bbt010

Links

Tools

Export citation

Search in Google Scholar

DRISEE overestimates errors in metagenomic sequencing data

This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

© The Author(s), 2013. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in Briefings in Bioinformatics 15 (2014): 783-787, doi:10.1093/bib/bbt010. ; The extremely high error rates reported by Keegan et al. in ‘A platform-independent method for detecting errors in metagenomic sequencing data: DRISEE’ (PLoS Comput Biol 2012;8:e1002541) for many next-generation sequencing datasets prompted us to re-examine their results. Our analysis reveals that the presence of conserved artificial sequences, e.g. Illumina adapters, and other naturally occurring sequence motifs accounts for most of the reported errors. We conclude that DRISEE reports inflated levels of sequencing error, particularly for Illumina data. Tools offered for evaluating large datasets need scrupulous review before they are implemented. ; National Institutes of Health [1UH2DK083993 to M.L.S.]; National Science Foundation [BDI- 096026 to S.M.H.].