A proteomics sample metadata representation for multiomics integration and big data analysis

Dai, Chengxin; Füllgrabe, Anja; Pfeuffer, Julianus; Solovyeva, Elizaveta M.; Deng, Jingwen; Moreno, Pablo; Kamatchinathan, Selvakumar; Kundu, Deepti Jaiswal; George, Nancy; Fexova, Silvie; Grüning, Björn; Föll, Melanie Christine; Griss, Johannes; Vaudel, Marc; Audain, Enrique; Locard-Paulet, Marie; Turewicz, Michael; Eisenacher, Martin; Uszkoreit, Julian; Van Den Bossche, Tim; Schwämmle, Veit; Webel, Henry; Schulze, Stefan; Bouyssié, David; Jayaram, Savita; Duggineni, Vinay Kumar; Samaras, Patroklos; Wilhelm, Mathias; Choi, Meena; Wang, Mingxun; Kohlbacher, Oliver; Brazma, Alvis; Papatheodorou, Irene; Bandeira, Nuno; Deutsch, Eric W.; Vizcaíno, Juan Antonio; Bai, Mingze; Sachsenberg, Timo; Levitsky, Lev I.; Perez-Riverol, Yasset

Published in

Nature Research, Nature Communications, 1(12), 2021

DOI: 10.1038/s41467-021-26111-3

Tools

Export citation

Search in Google Scholar

A proteomics sample metadata representation for multiomics integration and big data analysis

Journal article published in 2021 by Chengxin Dai, Anja Füllgrabe

, Julianus Pfeuffer, Elizaveta M. Solovyeva

, Jingwen Deng, Pablo Moreno

, Selvakumar Kamatchinathan, Deepti Jaiswal Kundu, Nancy George

, Silvie Fexova, Björn Grüning

, Melanie Christine Föll, Johannes Griss

, Marc Vaudel

, Enrique Audain

and other authors.

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving forbidden

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

AbstractThe amount of public proteomics data is rapidly increasing but there is no standardized format to describe the sample metadata and their relationship with the dataset files in a way that fully supports their understanding or reanalysis. Here we propose to develop the transcriptomics data format MAGE-TAB into a standard representation for proteomics sample metadata. We implement MAGE-TAB-Proteomics in a crowdsourcing project to manually curate over 200 public datasets. We also describe tools and libraries to validate and submit sample metadata-related information to the PRIDE repository. We expect that these developments will improve the reproducibility and facilitate the reanalysis and integration of public proteomics datasets.

Published in

Links

Tools

A proteomics sample metadata representation for multiomics integration and big data analysis

Abstract