A Filtering Method to Generate High Quality Short Reads Using Illumina Paired-End Technology

Eren, A. Murat; Murat Eren, A.; Vineis, Joseph H.; Morrison, Hilary G.; Sogin, Mitchell L.

Published in

Public Library of Science, PLoS ONE, 6(8), p. e66643, 2013

DOI: 10.1371/journal.pone.0066643

Tools

Export citation

Search in Google Scholar

A Filtering Method to Generate High Quality Short Reads Using Illumina Paired-End Technology

Journal article published in 2013 by A. Murat Eren, A. Murat Eren, Joseph H. Vineis, Hilary G. Morrison

, Mitchell L. Sogin

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Consensus between independent reads improves the accuracy of genome and transcriptome analyses, however lack of consensus between very similar sequences in metagenomic studies can and often does represent natural variation of biological significance. The common use of machine-assigned quality scores on next generation platforms does not necessarily correlate with accuracy. Here, we describe using the overlap of paired-end, short sequence reads to identify error-prone reads in marker gene analyses and their contribution to spurious OTUs following clustering analysis using QIIME. Our approach can also reduce error in shotgun sequencing data generated from libraries with small, tightly constrained insert sizes. The open-source implementation of this algorithm in Python programming language with user instructions can be obtained from https://github.com/meren/illumina-utils.

Published in

Links

Tools

A Filtering Method to Generate High Quality Short Reads Using Illumina Paired-End Technology

Abstract