Comparing a protein’s concentrations across two or more treatments
is the focus of many proteomics studies. A frequent source
of measurements for these comparisons is a mass spectrometry (MS)
analysis of a protein’s peptide ions separated by liquid chromatography
(LC) following its enzymatic digestion. Alas, LC-MS identification
and quantification of equimolar peptides can vary significantly due to
their unequal digestion, separation and ionization. This unequal measurability
of peptides, the largest source of LC-MS nuisance variation,
stymies confident comparison of a protein’s concentration across treatments.
Our objective is to introduce a mixed-effects statistical model for
comparative LC-MS proteomics studies. We describe LC-MS peptide
abundance with a linear model featuring pivotal terms that account for
unequal peptide LC-MS measurability. We advance fitting this model
to an often incomplete LC-MS dataset with REstricted Maximum Likelihood
(REML) estimation, producing estimates of model goodness-offit,
treatment effects, standard errors, confidence intervals, and protein
relative concentrations. We illustrate the model with an experiment
featuring a known dilution series of a filamentous ascomycete fungus
Trichoderma reesei protein mixture.
For the 781 of 1546 T.reesei proteins with sufficient data coverage,
the fitted mixed-effects models capably described the LC-MS measurements.
The LC-MS measurability terms effectively accounted for this
major source of uncertainty. Ninety percent of the relative concentration
estimates were within 1/2 fold of the true relative concentrations.
Akin to the common ratio method, this model also produced biased
estimates, albeit less biased. Bias decreased significantly, both absolutely
and relative to the ratio method, as the number of observed
peptides per protein increased.
Mixed-effects statistical modeling offers a flexible, well-established
methodology for comparative proteomics studies integrating common
experimental designs with LC-MS sample processing plans. It favorably
accounts for the unequal LC-MS measurability of peptides and
produces informative quantitative comparisons of a protein’s concentration
across treatments with objective measures of uncertainties.
Revised: August 11, 2011 |
Published: March 1, 2008
Citation
Daly D.S., K.K. Anderson, E.A. Panisko, S.O. Purvine, R. Fang, M.E. Monroe, and S.E. Baker. 2008.A mixed-effects Statistical Model for Comparative LC-MS Proteomics Studies.Journal of Proteome Research 7, no. 3:1209-1217.PNNL-SA-57824.doi:10.1021/pr070441i