April 19, 2012
Journal Article

A Hybrid Approach to Protein Differential Expression in Mass Spectrometry-Based Proteomics

Abstract

Motivation: Quantitative mass spectrometry-based proteomics involves statistical inference on protein abundance, based on the intensities of each protein’s associated spectral peaks. However, typical MS-based proteomics data sets have substantial proportions of missing observations, due at least in part to censoring of low intensities. This complicates intensity-based differential expression analysis. Results: We outline a statistical method for protein differential expression, based on a simple Binomial likelihood. By modeling peak intensities as binary, in terms of "presence / absence," we enable the selection of proteins not typically amendable to quantitative analysis; e.g., "one-state" proteins that are present in one condition but absent in another. In addition, we present an analysis protocol that combines quantitative and presence / absence analysis of a given data set in a principled way, resulting in a single list of selected proteins with a single associated FDR.

Revised: June 18, 2012 | Published: April 19, 2012

Citation

Wang X., G.A. Anderson, R.D. Smith, and A.R. Dabney. 2012. A Hybrid Approach to Protein Differential Expression in Mass Spectrometry-Based Proteomics. Bioinformatics 28, no. 12:1586-1591. PNNL-SA-84117. doi:10.1093/bioinformatics/bts193