Partially sequenced organisms, decoy searches and false discovery rates

Victor, Bjorn; Gabriël, Sarah; Kanobana, Kirezi; Mostovenko, Ekaterina; Polman, Katja; Dorny, Pierre; Deelder, André M.; Palmblad, Magnus

Published in

American Chemical Society, Journal of Proteome Research, 3(11), p. 1991-1995, 2012

DOI: 10.1021/pr201035r

Tools

Export citation

Search in Google Scholar

Partially sequenced organisms, decoy searches and false discovery rates

Journal article published in 2012 by Bjorn Victor, Sarah Gabriël, Kirezi Kanobana, Ekaterina Mostovenko, Katja Polman, Pierre Dorny, André M. Deelder, Magnus Palmblad

This paper was not found in any repository; the policy of its publisher is unknown or unclear.

Full text: Unavailable

Preprint: archiving allowed

Must obtain written permission from Editor
Must not violate ACS ethical Guidelines

Upload

Postprint: archiving restricted

Must obtain written permission from Editor
Must not violate ACS ethical Guidelines

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Tandem mass spectrometry is commonly used to identify peptides, typically by comparing their product ion spectra with those predicted from a protein sequence database and scoring these matches. The most reported quality metric for a set of peptide identifications is the false discovery rate (FDR), the fraction of expected false identifications in the set. This metric has so far only been used for completely sequenced organisms or known protein mixtures. We have investigated whether FDR estimations are also applicable in the case of partially sequenced organisms, where many high-quality spectra fail to identify the correct peptides because the latter are not present in the searched sequence database. Using real data from human plasma and simulated partial sequence databases derived from two complete human sequence databases with different levels of redundancy, we could demonstrate that the mixture model approach in PeptideProphet is robust for partial databases, particularly if used in combination with decoy sequences. We therefore recommend using this method when estimating the FDR and reporting peptide identifications from incompletely sequenced organisms.

Published in

Links

Tools

Partially sequenced organisms, decoy searches and false discovery rates

Abstract