Published in

Oxford University Press (OUP), Bioinformatics, 9(17), p. 840-842

DOI: 10.1093/bioinformatics/17.9.840

Links

Tools

Export citation

Search in Google Scholar

Detecting the impact of sequencing errors on SAGE data

Journal article published in 2001 by Jacques Colinge ORCID, Georg Feger
This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

SAGE data are obtained by sequencing short DNA tags. Due to the mistakes in DNA sequencing, SAGE data contain errors. We propose a new approach to identify tags whose abundance is biased by sequencing errors. This approach is based on a concept of neighbourhood: abundant tags can contaminate tags whose sequence is very close. The application of our approach reveals that moderately abundant tags can be generated by sequencing errors uniquely. It also allows for detecting correct rare tags. AVAILABILITY: Software is available only to non-profit entities and for non-commercial purposes upon request.