Strategies to enable large-scale proteomics for reproducible research

Poulos, Rebecca C.; Hains, Peter G.; Shah, Rohan; Lucas, Natasha; Xavier, Dylan; Manda, Srikanth S.; Anees, Asim; Koh, Jennifer M. S.; Mahboob, Sadia; Wittman, Max; Williams, Steven G.; Sykes, Erin K.; Hecker, Michael; Dausmann, Michael; Wouters, Merridee A.; Ashman, Keith; Yang, Jean; Wild, Peter J.; deFazio, Anna; Balleine, Rosemary L.; Tully, Brett; Aebersold, Ruedi; Speed, Terence P.; Liu, Yansheng; Reddel, Roger R.; Robinson, Phillip J.; Zhong, Qing

Published in

Nature Research, Nature Communications, 1(11), 2020

DOI: 10.1038/s41467-020-17641-3

Tools

Export citation

Search in Google Scholar

Strategies to enable large-scale proteomics for reproducible research

Journal article published in 2020 by Rebecca C. Poulos

, Peter G. Hains

, Rohan Shah

, Natasha Lucas, Dylan Xavier

, Srikanth S. Manda

, Asim Anees, Jennifer M. S. Koh, Sadia Mahboob

, Max Wittman, Steven G. Williams

, Erin K. Sykes

, Michael Hecker, Michael Dausmann

, Merridee A. Wouters

and other authors.

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving forbidden

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

AbstractReproducible research is the bedrock of experimental science. To enable the deployment of large-scale proteomics, we assess the reproducibility of mass spectrometry (MS) over time and across instruments and develop computational methods for improving quantitative accuracy. We perform 1560 data independent acquisition (DIA)-MS runs of eight samples containing known proportions of ovarian and prostate cancer tissue and yeast, or control HEK293T cells. Replicates are run on six mass spectrometers operating continuously with varying maintenance schedules over four months, interspersed with ~5000 other runs. We utilise negative controls and replicates to remove unwanted variation and enhance biological signal, outperforming existing methods. We also design a method for reducing missing values. Integrating these computational modules into a pipeline (ProNorM), we mitigate variation among instruments over time and accurately predict tissue proportions. We demonstrate how to improve the quantitative analysis of large-scale DIA-MS data, providing a pathway toward clinical proteomics.

Published in

Links

Tools

Strategies to enable large-scale proteomics for reproducible research

Abstract