Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update website section on MS-based proteomics #79

Merged
merged 3 commits into from
Aug 26, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions docs/conf.py
Original file line number Diff line number Diff line change
Expand Up @@ -98,8 +98,8 @@
# html_title = u'Proteomics imputation modelling mass spectrometry (PIMMS)'

html_theme_options = {
"github_url": "https://github.com/RasmussenLab/python_package",
"repository_url": "https://github.com/RasmussenLab/python_package",
"github_url": "https://github.com/RasmussenLab/pimms",
"repository_url": "https://github.com/RasmussenLab/pimms",
"repository_branch": "main",
"home_page_in_toc": True,
"path_to_docs": "docs",
Expand Down
141 changes: 15 additions & 126 deletions docs/proteomics_intro.md
Original file line number Diff line number Diff line change
@@ -1,92 +1,31 @@
# Proteomics Introductions

## Lennart Martens - Introduction to Proteomics
> [lecture videos on youtube](https://www.youtube.com/watch?v=Wy1SwrMzhYk&list=PLXxp6nsBenSX_W8DiOocKJ0laNauYNdYl)
- Digitalizer: Transform an analog signal into a digital signal
There is a recent tutorial by
[Steven Shuken (2023) on massspetrometry-based proteomics](https://pubs.acs.org/doi/10.1021/acs.jproteome.2c00838), I enjoyed reading.

- Ionization Types
- MALDI: Analyte (peptide) is charged by matrix molecules by *one* ion only (mechanism is not so clear)
- three common matrix molecules in proteomics (CHCA, SA, DHB): Benzeme group taking up laser energy.
Inherent bias to three amino-acids by MALDI due to three amino-acids sharing conformational similarity with benzeme group
- Electrospray ionization (ESI): Acid is added to the liquid in order to obtain charged peptides.
- temperature of the needle
- peptides can take more than one ion

- roughly 150.000 possible peptides if 5000 genes are expressed in a tissue (without any PTMs)

- Detectors get worse over time. In the maintence sample one should see a spike when a new
detector is replaced for the old one.

- Fragmentation of peptides
- collision-induced dissociation (CID): b and y ions
- ergodic process, continously stronger vibration of peptide cleaves PSMs
- electron-capture dissociation (ECD): c and z ions
- preserves PTMs as the fracturing is non- ergodic (vibration induced)

## Intro (OpenMS from Tübingen)
> [Recording on youtube](https://www.youtube.com/watch?v=tnS-w8ggEAA&list=PL_6fafgzU1nHd13qqjm3uNvyHh62JpPcB&index=2&t=0s)
- *Ion mobility* as a way to identify number of charges (ions), adding another dimension to the data
- predict MS2 peptide itensities in order to better identify peptides ([MS2PIP](https://iomics.ugent.be/ms2pip/), [DeepMass: Prism](https://github.com/verilylifesciences/deepmass), Prosit)-
- Question: Does this take amino-acid sequences and provides spectra?
- number of mappings from peptides to protein (How many peptides per peptide?)
- absoute quantification siscpa, aqua
- feature-based label-free quantification
- does scale to (100?)
- quantification of isotopes (3D integral: intensity over retention time and m/z )
- [SWATH-MS](https://imsb.ethz.ch/research/aebersold/research/swath-ms.html): DIA in DDA setting?
- [pyOpenMS](https://pyopenms.readthedocs.io/en/latest/)
- HUGO PSI Standards Formats: Machines do not provide all the same standardized dataset.
- KNIME is popular for custom machines. `Nextflow` for cloud providers



## Mass Spectrometry
- Unbiased analysis that does not require prior knowledge of the sample composition
- Analytical technique which identifies molecules based on their mass and charge (m/z)
- Proteomics: the large-scale study of proteins.
Lennart Martens has an introduction to Proteomics [lecture series on youtube](https://www.youtube.com/watch?v=Wy1SwrMzhYk&list=PLXxp6nsBenSX_W8DiOocKJ0laNauYNdYl) from 2016, which I watched to get started.

An introduction I have seen live in 2020 was from the OpenMS team from Tübingen at the EuBICs developer
meeting in Nyborg, where you can find the [recordings on youtube](https://www.youtube.com/watch?v=tnS-w8ggEAA&list=PL_6fafgzU1nHd13qqjm3uNvyHh62JpPcB&index=2&t=0s).

## Pipeline
Graphical overview of the Data Dependent Analysis (DDA) pipeline.

![Proteomics Pipeline](Figures/fig_proteomics_pipeline.png)

### Liquid Chromatorgraphy (LC)
- Peptide separation by hydrophobicity
- hydrophilic vs hydrophobic liquids (Acetonitrile)

### Column
- reverse phase (chromatography)
(also called RPC, reverse-phase chromatography, or hydrophobic chromatography)
includes any chromatographic method that uses a hydrophobic stationary phase.
RPC refers to liquid (rather than gas) chromatography.)
> Reversed-phase chromatography (also called RPC, reverse-phase chromatography, or hydrophobic chromatography) includes any chromatographic method that uses a hydrophobic stationary phase. RPC refers to liquid (rather than gas) chromatography.
> (...) Reversed-phase chromatography is a technique using alkyl chains covalently bonded to the stationary phase particles in order to create a hydrophobic stationary phase, which has a stronger affinity for hydrophobic or less polar compounds. The use of a hydrophobic stationary phase is essentially the reverse of normal phase chromatography, since the polarity of the mobile and stationary phases have been inverted – hence the term reversed-phase chromatography.

- 75um ID packed with 3um/1.9um reverse phase C18 beads. Pulled fused silica
- Column performance is very important for your experiments.
- If the column is not packed perfectly you will have dead volumes and peak tailing.
- You will pick the same peptides for identification

### Mass Specometry
> "One of the most significant differences between transcriptomics and proteomics is in the dynamic range of mRNA and protein concentrations inside the cell. While the protein abundances stretch over at least seven orders of magnitude, from one copy per cell to ten million copies per cell, the mRNA dynamic range covers only three or four orders of magnitude." (https://doi.org/10.1002/pmic.201200451)

Claim: Around 5000 proteins should be identified for each sample.

#### Data Dependent Acquistion (DDA)
Orbitrap specific steps:
1. MS1: mix of peptides to identify most candidates for MS2 scan
2. MS2: one peptide (z/m ratio) which is then fragmented and scanned

> Default: 12 MS2 and 1MS1 scan in parallel
### Orbitrap
Schematic representation of an Orbitrap mass spectrometer.
![Schema Orbitrap](Figures/schema_orbitrap_instrument.jpg)

### Peptide Identification
- How do we get from acquired spectra to protein and peptide identifications?
- some peptides have the same mass
- To identify peptides the mass spectrometer performs a fragment (MS2) scan on an isolated peptide ion
- peptides with the same m/z ratio are fragmented and then analyzed ("de novo" sequencing)
Missing terms in the figure:
Term | meaning
---- | ---
c-Trap | collects one million particles before forwarding them to the Orbitrap
HCD | Higher-energy collisional dissociation (MS2 fragmentation of MS1 ions)


## Amino Acids and residuals
A table for reference.

Name | abr | code | Residue Mass
--- | --- | --- | ---
Expand All @@ -110,54 +49,4 @@ Trypthophan | Trp | W | 186.07931
Tyrosine | Tyr | Y | 163.06333
Valine | Val | V | 99.06841

- Residue mass is referring to the mass in an peptide of a amino acid


## Confunding Factors (or Hyperparameters)
> Critical parameters for DDA methods. Recommendation for machines at CPR

- Max Injection Time
- The maximum time which the instrument will use to reach the target amount of ions in the C-trap
- Low max injection times gives faster scans speed.
- High max injection times gives better intensity and dynamic range

- Automatic Gain Control (AGC) Target
- The target ion amount which will be accumulated in the C- trap
- A higher AGC target will give higher intensity

- Dynamic Exclusion Time
- The time which the instrument will exclude precursors already selected for MS2.
- Exclusion time is dependent on the length of your gradient.
- For 145 minutes, we usually have 30 seconds
- Number of MS2 scans (Top N)
- More MS2 scans gives deeper protein coverage but slower speed.


- sample overloading (too much liquid) messes up scan (relation to _Dynamic Range_?)
- dwell time
- the time a particular ion (m/z) signal is monitored
- cycle time


## Techniques
- TOF
- ORBITRAP
- PASEF
- FAIMS
- HCD Cell

### Orbitrap
![Schema Orbitrap](Figures/schema_orbitrap_instrument.jpg)

## Amino Acid weights
- fragments of peptides are identified on their weights


## Glossar

Term | meaning
---- | ---
c-Trap | Meaning of c? collects one million particles before forwarding them
elute | remove (an adsorbed substance) by washing with a solvent, especially in chromatography
HCD |
XIC | Extracted Ion Current
Loading