MGnify: the microbiome sequence data analysis resource in 2023

Richardson, Lorna; Allen, Ben; Baldi, Germana; Beracochea, Martin; Bileschi, Maxwell L.; Burdett, Tony; Burgin, Josephine; Caballero-Pérez, Juan; Cochrane, Guy; Colwell, Lucy J.; Curtis, Tom; Escobar-Zepeda, Alejandra; Gurbich, Tatiana A.; Kale, Varsha; Korobeynikov, Anton; Raj, Shriya; Rogers, Alexander B.; Sakharova, Ekaterina; Sanchez, Santiago; Wilkinson, Darren J.; Finn, Robert D.

Published in

Oxford University Press, Nucleic Acids Research, D1(51), p. D753-D759, 2022

DOI: 10.1093/nar/gkac1080

Tools

Export citation

Search in Google Scholar

MGnify: the microbiome sequence data analysis resource in 2023

Journal article published in 2022 by Lorna Richardson

, Ben Allen, Germana Baldi, Martin Beracochea, Maxwell L. Bileschi, Tony Burdett

, Josephine Burgin

, Juan Caballero-Pérez

, Guy Cochrane

, Lucy J. Colwell, Tom Curtis, Alejandra Escobar-Zepeda, Tatiana A. Gurbich

, Varsha Kale, Anton Korobeynikov and other authors.

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Abstract The MGnify platform (https://www.ebi.ac.uk/metagenomics) facilitates the assembly, analysis and archiving of microbiome-derived nucleic acid sequences. The platform provides access to taxonomic assignments and functional annotations for nearly half a million analyses covering metabarcoding, metatranscriptomic, and metagenomic datasets, which are derived from a wide range of different environments. Over the past 3 years, MGnify has not only grown in terms of the number of datasets contained but also increased the breadth of analyses provided, such as the analysis of long-read sequences. The MGnify protein database now exceeds 2.4 billion non-redundant sequences predicted from metagenomic assemblies. This collection is now organised into a relational database making it possible to understand the genomic context of the protein through navigation back to the source assembly and sample metadata, marking a major improvement. To extend beyond the functional annotations already provided in MGnify, we have applied deep learning-based annotation methods. The technology underlying MGnify's Application Programming Interface (API) and website has been upgraded, and we have enabled the ability to perform downstream analysis of the MGnify data through the introduction of a coupled Jupyter Lab environment.

Published in

Links

Tools

MGnify: the microbiome sequence data analysis resource in 2023

Abstract