Published in

Oxford University Press, Bioinformatics, 23(33), p. 3767-3775, 2017

DOI: 10.1093/bioinformatics/btx458

Links

Tools

Export citation

Search in Google Scholar

proFIA: a data preprocessing workflow for flow injection analysis coupled to high-resolution mass spectrometry

This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

Abstract Motivation Flow Injection Analysis coupled to High-Resolution Mass Spectrometry (FIA-HRMS) is a promising approach for high-throughput metabolomics. FIA-HRMS data, however, cannot be preprocessed with current software tools which rely on liquid chromatography separation, or handle low resolution data only. Results We thus developed the proFIA package, which implements a suite of innovative algorithms to preprocess FIA-HRMS raw files, and generates the table of peak intensities. The workflow consists of 3 steps: (i) noise estimation, peak detection and quantification, (ii) peak grouping across samples and (iii) missing value imputation. In addition, we have implemented a new indicator to quantify the potential alteration of the feature peak shape due to matrix effect. The preprocessing is fast (less than 15 s per file), and the value of the main parameters (ppm and dmz) can be easily inferred from the mass resolution of the instrument. Application to two metabolomics datasets (including spiked serum samples) showed high precision (96%) and recall (98%) compared with manual integration. These results demonstrate that proFIA achieves very efficient and robust detection and quantification of FIA-HRMS data, and opens new opportunities for high-throughput phenotyping. Availability and implementation The proFIA software (as well as the plasFIA dataset) is available as an R package on the Bioconductor repository (http://bioconductor.org/packages/proFIA), and as a Galaxy module on the Main Toolshed (https://toolshed.g2.bx.psu.edu), and on the Workflow4Metabolomics online infrastructure (http://workflow4metabolomics.org). Supplementary information Supplementary data are available at Bioinformatics online.