Dissemin is shutting down on January 1st, 2025

Published in

Gigabyte Journal, Gigabyte, (2021), p. 1-14, 2021

DOI: 10.46471/gigabyte.29

Links

Tools

Export citation

Search in Google Scholar

Genome assembly and annotation of the tambaqui (Colossoma macropomum): an emblematic fish of the Amazon River basin

This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Green circle
Published version: archiving allowed
Data provided by SHERPA/RoMEO

Abstract

Colossoma macropomum, known as “tambaqui”, is the largest Characiformes fish in the Amazon River Basin and a leading species in Brazilian aquaculture and fisheries. Good quality meat and excellent adaptability to culture systems are some of its remarkable farming features. To support studies into the genetics and genomics of the tambaqui, we have produced the first high-quality genome for the species. We combined Illumina and PacBio sequencing technologies to generate a reference genome, assembled with 39× coverage of long reads and polished to a consensus quality value (QV) of 36 with 130× coverage of short reads. The genome was assembled into 1269 scaffolds (a total of 1,221,847,006 bases), with a scaffold N50 size of 40 Mb, where 93% of all assembled bases were placed in the largest 54 scaffolds corresponding to the diploid karyotype of the tambaqui. Furthermore, the NCBI Annotation Pipeline annotated genes, pseudogenes, and non-coding transcripts using the RefSeq database as evidence, guaranteeing a high-quality annotation. A Genome Data Viewer for the tambaqui was produced, which will benefit groups interested in exploring the unique genomic features of the species. The availability of a highly accurate genome assembly for tambaqui provides the foundation for the discovery of novel ecological and evolutionary insights, and is a helpful resource for aquaculture.