Benchmarking of analysis strategies for data-independent acquisition proteomics using a large-scale dataset comprising inter-patient heterogeneity

Fröhlich, Klemens; Brombacher, Eva; Fahrner, Matthias; Vogele, Daniel; Kook, Lucas; Pinter, Niko; Bronsert, Peter; Timme-Bronsert, Sylvia; Schmidt, Alexander; Bärenfaller, Katja; Kreutz, Clemens; Schilling, Oliver

Published in

Nature Research, Nature Communications, 1(13), 2022

DOI: 10.1038/s41467-022-30094-0

Tools

Export citation

Search in Google Scholar

Benchmarking of analysis strategies for data-independent acquisition proteomics using a large-scale dataset comprising inter-patient heterogeneity

Journal article published in 2022 by Klemens Fröhlich

, Eva Brombacher

, Matthias Fahrner

, Daniel Vogele

, Lucas Kook, Niko Pinter, Peter Bronsert

, Sylvia Timme-Bronsert, Alexander Schmidt

, Katja Bärenfaller

, Clemens Kreutz

, Oliver Schilling

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving forbidden

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

AbstractNumerous software tools exist for data-independent acquisition (DIA) analysis of clinical samples, necessitating their comprehensive benchmarking. We present a benchmark dataset comprising real-world inter-patient heterogeneity, which we use for in-depth benchmarking of DIA data analysis workflows for clinical settings. Combining spectral libraries, DIA software, sparsity reduction, normalization, and statistical tests results in 1428 distinct data analysis workflows, which we evaluate based on their ability to correctly identify differentially abundant proteins. From our dataset, we derive bootstrap datasets of varying sample sizes and use the whole range of bootstrap datasets to robustly evaluate each workflow. We find that all DIA software suites benefit from using a gas-phase fractionated spectral library, irrespective of the library refinement used. Gas-phase fractionation-based libraries perform best against two out of three reference protein lists. Among all investigated statistical tests non-parametric permutation-based statistical tests consistently perform best.

Published in

Links

Tools

Benchmarking of analysis strategies for data-independent acquisition proteomics using a large-scale dataset comprising inter-patient heterogeneity

Abstract