Ab initio construction of a eukaryotic transcriptome by massively parallel mRNA sequencing

Yassour, Moran; Kaplan, Tommy; Fraser, Hunter B.; Levin, Joshua Z.; Pfiffner, Jenna; Adiconis, Xian; Schroth, Gary; Luo, Shujun; Khrebtukova, Irina; Gnirke, Andreas; Nusbaum, Chad; Thompson, Dawn-Anne; Friedman, Nir; Regev, Aviv

Published in

National Academy of Sciences, Proceedings of the National Academy of Sciences, 9(106), p. 3264-3269, 2009

DOI: 10.1073/pnas.0812841106

Tools

Export citation

Search in Google Scholar

Ab initio construction of a eukaryotic transcriptome by massively parallel mRNA sequencing

Journal article published in 2009 by Moran Yassour

, Tommy Kaplan, Hunter B. Fraser, Joshua Z. Levin, Jenna Pfiffner, Xian Adiconis, Gary Schroth, Shujun Luo, Irina Khrebtukova, Andreas Gnirke, Chad Nusbaum, Dawn-Anne Thompson, Nir Friedman, Aviv Regev

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving forbidden

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Defining the transcriptome, the repertoire of transcribed regions encoded in the genome, is a challenging experimental task. Current approaches, relying on sequencing of ESTs or cDNA libraries, are expensive and labor-intensive. Here, we present a general approach for ab initio discovery of the complete transcriptome of the budding yeast, based only on the unannotated genome sequence and millions of short reads from a single massively parallel sequencing run. Using novel algorithms, we automatically construct a highly accurate transcript catalog. Our approach automatically and fully defines 86% of the genes expressed under the given conditions, and discovers 160 previously undescribed transcription units of 250 bp or longer. It correctly demarcates the 5′ and 3′ UTR boundaries of 86 and 77% of expressed genes, respectively. The method further identifies 83% of known splice junctions in expressed genes, and discovers 25 previously uncharacterized introns, including 2 cases of condition-dependent intron retention. Our framework is applicable to poorly understood organisms, and can lead to greater understanding of the transcribed elements in an explored genome.

Published in

Links

Tools

Ab initio construction of a eukaryotic transcriptome by massively parallel mRNA sequencing

Abstract