Dissemin is shutting down on January 1st, 2025

Published in

Nature Research, Scientific Data, 1(11), 2024

DOI: 10.1038/s41597-024-03215-1

Links

Tools

Export citation

Search in Google Scholar

Three de novo assembled wild cacao genomes from the Upper Amazon

This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Red circle
Postprint: archiving forbidden
Green circle
Published version: archiving allowed
Data provided by SHERPA/RoMEO

Abstract

AbstractTheobroma cacao, the chocolate tree, is indigenous to the Amazon basin, the greatest biodiversity hotspot on earth. Recent advancement in plant genomics highlights the importance of de novo sequencing of multiple reference genomes to capture the genome diversity present in different cacao populations. In this study, three high-quality chromosome-level genomes of wild cacao were constructed, de novo assembled with HiFi long reads sequencing, and scaffolded using a reference-free strategy. These genomes represent the three most important genetic clusters of cacao trees from the Upper Amazon region. The three wild cacao genomes were compared with two reference genomes of domesticated cacao. The five cacao genetic clusters were inferred to have diverged in the early and middle Pleistocene period, approximately 1.83–0.69 million years ago. The results shown here serve as an example of understanding how the Amazonian biodiversity was developed. The three wild cacao genomes provide valuable resources for studying genetic diversity and advancing genetic improvement of this species.