Optimized time alignment algorithm for LC-MS data: Correlation optimized warping using component detection algorithm-selected mass chromatograms

  • Christin Christin
  • , Age K. Smilde
  • , Huub C. J. Hoefsloot
  • , Frank Suits
  • , Rainer Bischoff
  • , Peter L. Horvatovich*
  • *Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

75 Citations (Scopus)

Abstract

Correlation optimized warping (COW) based on the total ion current (TIC) is a widely used time alignment algorithm (COW-TIC). This approach works successfully on chromatograms containing few compounds and having a well-defined TIC. In this paper, we have combined COW with a component detection algorithm (CODA) to align LC-MS chromatograms containing thousands of biological compounds with overlapping chromatographic peaks, a situation where COW-TIC often fails. CODA is a variable selection procedure that selects mass chromatograms with low noise and low background (so-called "high-quality" mass chromatograms). High-quality mass chromatograms selected in each COW segment ensure that the same compounds (based on their mass and their retention time) are used in the two-dimensional benefit function of COW to obtain correct and optimal alignments (COW-CODA). The performance of the COW-CODA algorithm was evaluated on three types of complex data sets obtained from the LC-MS analysis of samples commonly used for biomarker discovery and compared to COW-TIC using a new global comparison method based on overlapping peak area: trypsin-digested serum obtained from cervical cancer patients, trypsin-digested serum from a single patient that was treated with varying preanalytical parameters (factorial design study), and urine from pregnant and nonpregnant women. While COW-CODA did result in minor misalignments in rare cases, it was clearly superior to the COW-TIC algorithm, especially when applied to highly variable chromatograms (factorial design, urine). The presented algorithm thus enables automatic time alignment and accurate peak matching of multiple LC-MS data sets obtained from complex body fluids that are often used for biomarker discovery.

Original languageEnglish
Pages (from-to)7012-7021
Number of pages10
JournalAnalytical Chemistry
Volume80
Issue number18
DOIs
Publication statusPublished - 15-Sept-2008

Keywords

  • RETENTION TIME
  • HUMAN SERUM
  • CHEMOMETRIC ANALYSIS
  • BIOMARKER DISCOVERY
  • DATA SETS
  • SPECTROMETRY
  • PROTEOMICS
  • PROFILES
  • QUANTIFICATION
  • METABOLITES

Fingerprint

Dive into the research topics of 'Optimized time alignment algorithm for LC-MS data: Correlation optimized warping using component detection algorithm-selected mass chromatograms'. Together they form a unique fingerprint.

Cite this