4.7 Article

Postnovo: Postprocessing Enables Accurate and FDR-Controlled de Novo Peptide Sequencing

Journal

JOURNAL OF PROTEOME RESEARCH
Volume 17, Issue 11, Pages 3671-3680

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/acs.jproteome.8b00278

Keywords

de novo peptide sequencing; mass spectrometry; false discovery rate

Funding

  1. Gordon and Betty Moore Foundation [3305, 3306]
  2. Simons Foundation [402971]

Ask authors/readers for more resources

De novo sequencing offers an alternative to database search methods for peptide identification from mass spectra. Since it does not rely on a predetermined database of expected or potential sequences in the sample, de novo sequencing is particularly appropriate for samples lacking a well-defined or comprehensive reference database. However, the low accuracy of many de novo sequence predictions has prevented the widespread use of the variety of sequencing tools currently available. Here, we present a new open-source tool, Postnovo, that postprocesses de novo sequence predictions to find high-accuracy results. Postnovo uses a predictive model to rescore and rerank candidate sequences in a manner akin to database search postprocessing tools such as Percolator. Postnovo leverages the output from multiple de novo sequencing tools in its own analyses, producing many times the length of amino acid sequence information (including both full- and partial-length peptide sequences) at an equivalent false discovery rate (FDR) compared to any individual tool. We present a methodology to reliably screen the sequence predictions to a desired FDR given the Postnovo sequence score. We validate Postnovo with multiple data sets and demonstrate its ability to identify proteins that are missed by database search even in samples with paired reference databases.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available