Published in

Cold Spring Harbor Laboratory Press, Genome Research, 12(21), p. 2224-2241, 2011

DOI: 10.1101/gr.126599.111

Links

Tools

Export citation

Search in Google Scholar

Assemblathon 1: A competitive assessment of de novo short read assembly methods

Journal article published in 2011 by Dent A. Earl, Keith Bradnam ORCID, Dent A. Earl, John St. John, Aaron Darling, Dawei Lin, Joseph Fass, Hung On Ken Yu, Joseph Faas, Vince Buffalo, Daniel R. Zerbino, Hung On Ken Yu, Daniel R. Zerbino, Mark Diekhans, Ngan Nguyen and other authors.
This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Red circle
Postprint: archiving forbidden
Orange circle
Published version: archiving restricted
Data provided by SHERPA/RoMEO

Abstract

Low-cost short read sequencing technology has revolutionized genomics, though it is only just becoming practical for the high-quality de novo assembly of a novel large genome. We describe the Assemblathon 1 competition, which aimed to comprehensively assess the state of the art in de novo assembly methods when applied to current sequencing technologies. In a collaborative effort, teams were asked to assemble a simulated Illumina HiSeq data set of an unknown, simulated diploid genome. A total of 41 assemblies from 17 different groups were received. Novel haplotype aware assessments of coverage, contiguity, structure, base calling, and copy number were made. We establish that within this benchmark: (1) It is possible to assemble the genome to a high level of coverage and accuracy, and that (2) large differences exist between the assemblies, suggesting room for further improvements in current methods. The simulated benchmark, including the correct answer, the assemblies, and the code that was used to evaluate the assemblies is now public and freely available from http://www.assemblathon.org/.