TY - JOUR
T1 - Assessing bias in experiment design for large scale mass spectrometry-based quantitative proteomics
AU - Prakash, Amol
AU - Piening, Brian
AU - Whiteaker, Jeff
AU - Zhang, Heidi
AU - Shaffer, Scott A.
AU - Martin, Daniel
AU - Hohmann, Laura
AU - Cooke, Kelly
AU - Olson, James M.
AU - Hansen, Stacey
AU - Flory, Mark R.
AU - Lee, Hookeun
AU - Watts, Julian
AU - Goodlett, David R.
AU - Aebersold, Ruedi
AU - Paulovich, Amanda
AU - Schwikowski, Benno
PY - 2007/10
Y1 - 2007/10
N2 - Mass spectrometry-based proteomics holds great promise as a discovery tool for biomarker candidates in the early detection of diseases. Recently much emphasis has been placed upon producing highly reliable data for quantitative profiling for which highly reproducible methodologies are indispensable. The main problems that affect experimental reproducibility stem from variations introduced by sample collection, preparation, and storage protocols and LC-MS settings and conditions. On the basis of a formally precise and quantitative definition of similarity between LC-MS experiments, we have developed Cha-order, a fully automatic software tool that can assess experimental reproducibility of sets of large scale LC-MS experiments. By visualizing the similarity relationships within a set of experiments, this tool can form the basis of systematic quality control and thus help assess the comparability of mass spectrometry data over time, across different laboratories, and between instruments. Applying Chaorder to data from multiple laboratories and a range of instruments, experimental protocols, and sample complexities revealed biases introduced by the sample processing steps, experimental protocols, and instrument choices. Moreover we show that reducing bias by correcting for just a few steps, for example randomizing the run order, does not provide much gain in statistical power for biomarker discovery.
AB - Mass spectrometry-based proteomics holds great promise as a discovery tool for biomarker candidates in the early detection of diseases. Recently much emphasis has been placed upon producing highly reliable data for quantitative profiling for which highly reproducible methodologies are indispensable. The main problems that affect experimental reproducibility stem from variations introduced by sample collection, preparation, and storage protocols and LC-MS settings and conditions. On the basis of a formally precise and quantitative definition of similarity between LC-MS experiments, we have developed Cha-order, a fully automatic software tool that can assess experimental reproducibility of sets of large scale LC-MS experiments. By visualizing the similarity relationships within a set of experiments, this tool can form the basis of systematic quality control and thus help assess the comparability of mass spectrometry data over time, across different laboratories, and between instruments. Applying Chaorder to data from multiple laboratories and a range of instruments, experimental protocols, and sample complexities revealed biases introduced by the sample processing steps, experimental protocols, and instrument choices. Moreover we show that reducing bias by correcting for just a few steps, for example randomizing the run order, does not provide much gain in statistical power for biomarker discovery.
UR - http://www.scopus.com/inward/record.url?scp=35648980489&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=35648980489&partnerID=8YFLogxK
U2 - 10.1074/mcp.M600470-MCP200
DO - 10.1074/mcp.M600470-MCP200
M3 - Article
C2 - 17617667
AN - SCOPUS:35648980489
SN - 1535-9476
VL - 6
SP - 1741
EP - 1748
JO - Molecular and Cellular Proteomics
JF - Molecular and Cellular Proteomics
IS - 10
ER -