Abstract
Most proteomics experiments make use of 'high throughput' technologies such a's 2-DE, MS or protein arrays to measure simultaneously the expression levels of thousands of proteins. Such experiments yield, large, high-dimensional data sets which usually reflect not only the biological but also technical and experimental factors. Statistical tools are essential for evaluating these data and preventing false conclusions. Here, an overview is given of some, typical statistical tools for proteomics experiments.
In particular, we present methods for data preprocessing (e.g. calibration, missing values estimation and outlier detection), comparison of protein expression in different groups (e.g. detection of differentially expressed proteins or classification of new observations) as well as the detection of dependencies between proteins (e.g. protein clusters or networks). We also discuss questions of sample size planning for some of these methods.
| Original language | English |
|---|---|
| Pages (from-to) | 48-55 |
| Number of pages | 8 |
| Journal | Proteomics |
| Volume | 6 |
| Issue number | S2 |
| DOIs | |
| Publication status | Published - Sept-2006 |
| Externally published | Yes |
Keywords
- DIFFERENCE GEL-ELECTROPHORESIS
- MASS-SPECTROMETRY
- MICROARRAY EXPERIMENTS
- CLASSIFICATION
- SERUM