Skip to content

Use/example for MS:4000156 (area under TIC RT quantiles) as quality criteria #310

@jorainer

Description

@jorainer

Picking up on a small discussion/question we had at the mzQC session in Tuebingen. We (in xcms-based LC-MS data preprocessing) generally create TICs and base peak chromatograms (BPC) to evaluate how similar the samples, respectively their chromatographic signals, are (within the same experiment). We use this to spot potential problematic samples/runs and ensure that e.g. QC samples are similar etc. Also, we use this to evaluate the performance of the retention time alignment step.

While this can be visually evaluated, we obviously also want to have a quantitative assessment. To do that we used to bin the BPCs (in bins of e.g. 2 seconds, or 5 seconds) and then used heatmaps and hierarchical clustering or correlation matrices to get a numeric estimate. After digging through the various mzQC measures I stumbled over the area under TIC RT quantiles measure - and we now use this, i.e. instead of binning the data in x second bins we bin the signal by e.g. 10 or 20 quantiles.

As an example: below are BPC of a small experiment, with one of the samples showing a retention time shift.

Image

A heatmap of area under BPC RT quantiles for that data is shown below. The samples/runs are along the y-axis and the retention time bins (19 quantiles) along the x-axis. The data is not bad, but the sample replicates do not group as expected.

Image

The same plots after retention time alignment:

Image

the shift was properly adjusted, and also the runs cluster as expected based on the area under BPC RT quantiles:

Image

We include this quality check now regularly in our analyses and will also add an example to our Metabonaut resource. My questions now would be:

  1. would it be possible to create a new "area under BPC RT quantiles" mzQC term?

  2. would this be something interesting to report in your paper (or supplement)? As an example/use case for this mzQC term? If yes, we would need to look for other (publicly available) input data to provide a completely reproducible example.

Happy to discuss :)

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions