Full-length transcriptome assembly from RNA-Seq data without a reference genome

Grabherr, Manfred G.; Haas, Brian J.; Yassour, Moran; Levin, Joshua Z.; Thompson, Dawn A.; Amit, Ido; Adiconis, Xian; Fan, Lin; Raychowdhury, Raktima; Zeng, Qiandong; Chen, Zehua; Mauceli, Evan; Hacohen, Nir; Gnirke, Andreas; Rhind, Nicholas; di Palma, Federica; Birren, Bruce W.; Nusbaum, Chad; Lindblad-Toh, Kerstin; Friedman, Nir; Regev, Aviv

Published in

Nature Research, Nature Biotechnology, 7(29), p. 644-652, 2011

DOI: 10.1038/nbt.1883

Tools

Export citation

Search in Google Scholar

Full-length transcriptome assembly from RNA-Seq data without a reference genome

Journal article published in 2011 by Manfred G. Grabherr, Brian J. Haas, Moran Yassour

, Joshua Z. Levin, Dawn A. Thompson, Ido Amit

, Xian Adiconis, Lin Fan, Raktima Raychowdhury, Qiandong Zeng, Zehua Chen, Evan Mauceli, Nir Hacohen, Andreas Gnirke, Nicholas Rhind and other authors.

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving restricted

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.

Published in

Links

Tools

Full-length transcriptome assembly from RNA-Seq data without a reference genome

Abstract