September 8, 2016
Journal Article

Multi-species identification of polymorphic peptide variants via propagation in spectral networks

Abstract

The spectral networks approach enables the detection of pairs of spectra from related peptides and thus allows for the propagation of annotations from identified peptides to unidentified spectra. Beyond allowing for unbiased discovery of unexpected post-translational modifications, spectral networks are also applicable to multi-species comparative proteomics or metaproteomics to identify numerous orthologous versions of a protein. We present algorithmic and statistical advances in spectral networks that have made it possible to rigorously assess the statistical significance of spectral pairs and accurately estimate the error rate of identifications via propagation. In the analysis of three related Cyanothece species, a model organism for biohydrogen production, spectral networks identified peptides with highly divergent sequences with up to dozens of variants per peptide, including many novel peptides in species that lack a sequenced genome. Furthermore, spectral networks strongly suggested the presence of novel peptides even in genomically characterized species (i.e. missing from databases) in that a significant portion of unidentified multi-species networks included at least two polymorphic peptide variants.

Revised: April 4, 2017 | Published: September 8, 2016

Citation

Na S., S.H. Payne, and N. Bandeira. 2016. Multi-species identification of polymorphic peptide variants via propagation in spectral networks. Molecular & Cellular Proteomics 15, no. 11:3501-3512. PNNL-SA-110651. doi:10.1074/mcp.O116.060913