diff --git a/_site/omics/week-3/images/88H-exp-design-jillian.png b/_site/omics/week-3/images/88H-exp-design-jillian.png new file mode 100644 index 0000000..e46a2ba Binary files /dev/null and b/_site/omics/week-3/images/88H-exp-design-jillian.png differ diff --git a/_site/omics/week-3/study_before_workshop.html b/_site/omics/week-3/study_before_workshop.html index 6b4a1ac..bc51bab 100644 --- a/_site/omics/week-3/study_before_workshop.html +++ b/_site/omics/week-3/study_before_workshop.html @@ -1,898 +1,887 @@ - - - - - - - - - -Data Analysis for Group Project - Independent Study to prepare for workshop - - - - - + - - - - - + - - - - - - - - - - + + + - + + Data Analysis for Group Project - Independent Study to prepare for workshop + + + + + + + + + + + + + + + + + +
+
+ +
+

Independent Study to prepare for workshop

+

Omics 1: Hello data!

+ +
+
+
+Emma Rand +
+
+
+ +

6 October, 2023

-
-

Sequence Data

-
    -
  • reads
  • -
  • quality control
  • -
  • align/pseudoalign
  • -
  • quantify
  • -
  • normalise
  • -
+
+

🚧 NB still in construction 🚧

-
-

Aims

+
+

Overview

+
    -
  • find important genes
  • -
  • what is ‘important’ -
      -
    • expressed
    • -
    • differentially expressed i.e., which genes are significantly higher in one group vs another
    • -
  • +
  • Concise summary of the experimental design and aims

  • +
  • What the raw data consist of

  • +
  • What has been done to the data so far

  • +
  • What steps we will take in the workshop

+
-
-

What is a read

+
+

The Data

+

There are three datasets

    -
  • FASTQ format files -
      -
    • sequences and information about each sequence’s read accuracy
    • -
  • +
  • 🐸 transcriptomic data (bulk RNA-seq) from frog embryos.

  • +
  • 🐭 transcriptomic data (single cell RNA-seq) from stemcells

  • +
  • 🍂 ??????? Metabolomic / Metagenomic data from anaerobic digesters

-

-
-

Differential expression

-
    -
  • what is differentially expressed
  • -
  • how are DE genes related
  • -
  • what are DE genes involved with
  • -
+
+
+

Experimental design

+
-
-

Stem cells: background

-

Cells were sorted using flow cytometry on the basis of cell surface markers. There are three cell types:

+
+

🐸 Experimental design

+ +

Schematic of frog development experiment

+
+

🐸 Experimental design

+ +

Schematic of frog development experiment

    -
  • long-term haematopoetic stem cells (LT-HSCs) defined as : Lineage- ckit+ Sca1+ CD34- Flk2-

  • -
  • haematopoetic stem and progenitor cells (HSPCs) defined as : Lineage- Sca1+ ckit+

  • -
  • progenitor cells (Progs) defined as : Lineage- Sca1- ckit+

  • +
  • 3 fertilisations

  • +
  • two siblings from each fertilisation one control, on FGF treated

  • +
  • sequenced at three time points

  • +
  • 3 x 2 x 3 = 18 groups

-

Each cell is then sequenced to quantify all the transcripts in each cell. different transcripts (genes) were identified.

+
-
-

Stem cells: background

+
+

🐸 Experimental design

+ +

Schematic of frog development experiment

+
-
-

Stem cells: Processing so far

+
+

🐸 Aim

+
    -
  • selection of cells
  • -
  • selection of genes: subset of surfaceome
  • -
  • log2 normalised values
  • +
  • find genes important in frog development

  • +
  • Important means genes that are differentially expressed between the control and the FGF treated sibling

  • +
  • Differentially expressed means the expression on one group is signifcantly higher than the other

+
-
-

Stem cells: Aims

+
+

🐸 Guided analysis

+
    -
  • Find interesting cell surface molecule genes that vary between cell types.
  • +
  • The workshops will take you through comparing the control and FGF treated sibling at S30

  • +
  • This is the “least interesting” comparison

  • +
  • You will be guided to carefully document your work so you can apply the same methods to other comparisons

+
-
-

Frog development: background

+
+

🐭 Experimental design

+ +

Schematic of stem cell experiment

+
+

🐭 Experimental design

+ +

Schematic of stem cell experiment

    -
  • 3 fertilisations = replicates
  • -
  • two siblings from each fertilisation one control, on FGF treated
  • -
  • sequenced at three time points: S14, S20, S30
  • -
  • 3 x 2 x 3 = 18 groups
  • +
  • Cells were sorted using flow cytometry on the basis of cell surface markers

  • +
  • There are three cell types: LT-HSCs, HSPCs, Progs

  • +
  • Many cells of each cell type were sequenced

-

+
-
-

Frog development: background

+
+

🐭 Experimental design

+ +

Schematic of stem cell experiment

+
-
-

Frog development: Processing so far

+
+

🐭 Aim

+
    -
  • selection of cells
  • -
  • selection of genes: subset of surfaceome
  • -
  • log2 normalised values
  • +
  • find genes for cell surface proteins that are important in stem cell identity

  • +
  • Important means genes that are differentially expressed between at least two cell types

  • +
  • Differentially expressed means the expression on one group is significantly higher than the other

+
-
-

Frog development: Aims

+
+

🐭 Guided analysis

+
    -
  • Find interesting cell surface molecule genes that vary between cell types.
  • +
  • The workshops will take you through comparing the HSPC and Prog cells

  • +
  • This is the “least interesting” comparison

  • +
  • You will be guided to carefully document your work so you can apply the same methods to other comparisons

-
-
-

-
-
-

Deliverables

-
    -
  1. Describe the data

    -
      -
    • Number of cells/samples/reps/treatments
    • -
    • number of genes
    • -
    • type of expression values
    • -
    • prior processing
    • -
    • missing values
    • -
    • overview of expression
    • -
    • clustering of genes/samples
    • -
  2. -
  3. Report on differential expression between two groups

    -
      -
    • number of DE at 1%, 5% and 10%.
    • -
    • table of expression, fold changes, signifcance at each sig.
    • -
    • Volcano plot
    • -
  4. -
  5. Report list of marker candidate gene IDs for a cell type of choice. Justify filters. Table with fold FC, p values, IDs, canonical gene names

  6. -
  7. Interpret the biology by reporting on a few group of genes and the processes in which they are involved.

  8. -
  9. Report on your chosen genes and explain why you think they are good candidates for follow up work

  10. -
- +
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + - + + + + + + + + -

References

-
-Nestorowa, Sonia, Fiona K. Hamey, Blanca Pijuan Sala, Evangelia Diamanti, Mairi Shepherd, Elisa Laurenti, Nicola K. Wilson, David G. Kent, and Berthold Göttgens. 2016. “A Single-Cell Resolution Map of Mouse Hematopoietic Stem and Progenitor Cell Differentiation.” Blood 128 (8): e20–31. https://doi.org/10.1182/blood-2016-05-716480. + -
- + + + + + + + + + + + + + + + + + + -
- - - - + }); + + \ No newline at end of file diff --git a/_site/search.json b/_site/search.json index 54d6c98..68882ea 100644 --- a/_site/search.json +++ b/_site/search.json @@ -399,121 +399,77 @@ "href": "omics/week-3/study_before_workshop.html#what-is-a-read", "title": "Independent Study to prepare for workshop", "section": "What is a read", - "text": "What is a read\n\nFASTQ format files\n\nsequences and information about each sequence’s read accuracy", - "crumbs": [ - "Week 3: Hello data!", - "Prepare!" - ] + "text": "What is a read\n\nFASTQ format files\n\nsequences and information about each sequence’s read accuracy" }, { "objectID": "omics/week-3/study_before_workshop.html#differential-expression", "href": "omics/week-3/study_before_workshop.html#differential-expression", "title": "Independent Study to prepare for workshop", "section": "Differential expression", - "text": "Differential expression\n\nwhat is differentially expressed\nhow are DE genes related\nwhat are DE genes involved with", - "crumbs": [ - "Week 3: Hello data!", - "Prepare!" - ] + "text": "Differential expression\n\nwhat is differentially expressed\nhow are DE genes related\nwhat are DE genes involved with" }, { "objectID": "omics/week-3/study_before_workshop.html#stem-cells-background", "href": "omics/week-3/study_before_workshop.html#stem-cells-background", "title": "Independent Study to prepare for workshop", "section": "Stem cells: background", - "text": "Stem cells: background\nCells were sorted using flow cytometry on the basis of cell surface markers. There are three cell types:\n\nlong-term haematopoetic stem cells (LT-HSCs) defined as : Lineage- ckit+ Sca1+ CD34- Flk2-\nhaematopoetic stem and progenitor cells (HSPCs) defined as : Lineage- Sca1+ ckit+\nprogenitor cells (Progs) defined as : Lineage- Sca1- ckit+\n\nEach cell is then sequenced to quantify all the transcripts in each cell. different transcripts (genes) were identified.", - "crumbs": [ - "Week 3: Hello data!", - "Prepare!" - ] + "text": "Stem cells: background\nCells were sorted using flow cytometry on the basis of cell surface markers. There are three cell types:\n\nlong-term haematopoetic stem cells (LT-HSCs)\nhaematopoetic stem and progenitor cells (HSPCs)\nprogenitor cells (Progs)\n\nEach cell is then sequenced to quantify all the transcripts in each cell. different transcripts (genes) were identified." }, { "objectID": "omics/week-3/study_before_workshop.html#stem-cells-background-1", "href": "omics/week-3/study_before_workshop.html#stem-cells-background-1", "title": "Independent Study to prepare for workshop", "section": "Stem cells: background", - "text": "Stem cells: background\n\nRaw data: GEO Series GSE81682\nIllumina HiSeq\nshort reads 150-300bp\nA single-cell resolution map of mouse hematopoietic stem and progenitor cell differentiation (Nestorowa et al. 2016)\n3,840 samples\nReads were aligned using G-SNAP and the mapped reads were assigned to Ensembl genes HTSeq\nGSE81682_HTSeq_counts.txt.gz (bottom of the page). And GSE81682_HTSeq_counts.txt.zip", - "crumbs": [ - "Week 3: Hello data!", - "Prepare!" - ] + "text": "Stem cells: background\n\nRaw data: GEO Series GSE81682\nIllumina HiSeq\nshort reads 150-300bp\nA single-cell resolution map of mouse hematopoietic stem and progenitor cell differentiation (Nestorowa et al. 2016)\n3,840 samples\nReads were aligned using G-SNAP and the mapped reads were assigned to Ensembl genes HTSeq\nGSE81682_HTSeq_counts.txt.gz (bottom of the page). And GSE81682_HTSeq_counts.txt.zip" }, { "objectID": "omics/week-3/study_before_workshop.html#stem-cells-processing-so-far", "href": "omics/week-3/study_before_workshop.html#stem-cells-processing-so-far", "title": "Independent Study to prepare for workshop", "section": "Stem cells: Processing so far", - "text": "Stem cells: Processing so far\n\nselection of cells\nselection of genes: subset of surfaceome\nlog2 normalised values", - "crumbs": [ - "Week 3: Hello data!", - "Prepare!" - ] + "text": "Stem cells: Processing so far\n\nselection of cells\nselection of genes: subset of surfaceome\nlog2 normalised values" }, { "objectID": "omics/week-3/study_before_workshop.html#stem-cells-aims", "href": "omics/week-3/study_before_workshop.html#stem-cells-aims", "title": "Independent Study to prepare for workshop", "section": "Stem cells: Aims", - "text": "Stem cells: Aims\n\nFind interesting cell surface molecule genes that vary between cell types.", - "crumbs": [ - "Week 3: Hello data!", - "Prepare!" - ] + "text": "Stem cells: Aims\n\nFind interesting cell surface molecule genes that vary between cell types." }, { "objectID": "omics/week-3/study_before_workshop.html#frog-development-background", "href": "omics/week-3/study_before_workshop.html#frog-development-background", "title": "Independent Study to prepare for workshop", "section": "Frog development: background", - "text": "Frog development: background\n\n3 fertilisations = replicates\ntwo siblings from each fertilisation one control, on FGF treated\nsequenced at three time points: S14, S20, S30\n3 x 2 x 3 = 18 groups", - "crumbs": [ - "Week 3: Hello data!", - "Prepare!" - ] + "text": "Frog development: background\n\n3 fertilisations = replicates\ntwo siblings from each fertilisation one control, on FGF treated\nsequenced at three time points: S14, S20, S30\n3 x 2 x 3 = 18 groups" }, { "objectID": "omics/week-3/study_before_workshop.html#frog-development-background-1", "href": "omics/week-3/study_before_workshop.html#frog-development-background-1", "title": "Independent Study to prepare for workshop", "section": "Frog development: background", - "text": "Frog development: background\n\nRaw data: GEO Series GSE81682\nIllumina HiSeq\nshort reads 150-300bp\nA single-cell resolution map of mouse hematopoietic stem and progenitor cell differentiation (Nestorowa et al. 2016)\n3,840 samples\nReads were aligned using G-SNAP and the mapped reads were assigned to Ensembl genes HTSeq\nGSE81682_HTSeq_counts.txt.gz (bottom of the page). And GSE81682_HTSeq_counts.txt.zip", - "crumbs": [ - "Week 3: Hello data!", - "Prepare!" - ] + "text": "Frog development: background\n\nRaw data: GEO Series GSE81682\nIllumina HiSeq\nshort reads 150-300bp\nA single-cell resolution map of mouse hematopoietic stem and progenitor cell differentiation (Nestorowa et al. 2016)\n3,840 samples\nReads were aligned using G-SNAP and the mapped reads were assigned to Ensembl genes HTSeq\nGSE81682_HTSeq_counts.txt.gz (bottom of the page). And GSE81682_HTSeq_counts.txt.zip" }, { "objectID": "omics/week-3/study_before_workshop.html#frog-development-processing-so-far", "href": "omics/week-3/study_before_workshop.html#frog-development-processing-so-far", "title": "Independent Study to prepare for workshop", "section": "Frog development: Processing so far", - "text": "Frog development: Processing so far\n\nselection of cells\nselection of genes: subset of surfaceome\nlog2 normalised values", - "crumbs": [ - "Week 3: Hello data!", - "Prepare!" - ] + "text": "Frog development: Processing so far\n\nselection of cells\nselection of genes: subset of surfaceome\nlog2 normalised values" }, { "objectID": "omics/week-3/study_before_workshop.html#frog-development-aims", "href": "omics/week-3/study_before_workshop.html#frog-development-aims", "title": "Independent Study to prepare for workshop", "section": "Frog development: Aims", - "text": "Frog development: Aims\n\nFind interesting cell surface molecule genes that vary between cell types.", - "crumbs": [ - "Week 3: Hello data!", - "Prepare!" - ] + "text": "Frog development: Aims\n\nFind interesting cell surface molecule genes that vary between cell types." }, { "objectID": "omics/week-3/study_before_workshop.html#deliverables", "href": "omics/week-3/study_before_workshop.html#deliverables", "title": "Independent Study to prepare for workshop", "section": "Deliverables", - "text": "Deliverables\n\nDescribe the data\n\nNumber of cells/samples/reps/treatments\nnumber of genes\ntype of expression values\nprior processing\nmissing values\noverview of expression\nclustering of genes/samples\n\nReport on differential expression between two groups\n\nnumber of DE at 1%, 5% and 10%.\ntable of expression, fold changes, signifcance at each sig.\nVolcano plot\n\nReport list of marker candidate gene IDs for a cell type of choice. Justify filters. Table with fold FC, p values, IDs, canonical gene names\nInterpret the biology by reporting on a few group of genes and the processes in which they are involved.\nReport on your chosen genes and explain why you think they are good candidates for follow up work", - "crumbs": [ - "Week 3: Hello data!", - "Prepare!" - ] + "text": "Deliverables\n\nDescribe the data\n\nNumber of cells/samples/reps/treatments\nnumber of genes\ntype of expression values\nprior processing\nmissing values\noverview of expression\nclustering of genes/samples\n\nReport on differential expression between two groups\n\nnumber of DE at 1%, 5% and 10%.\ntable of expression, fold changes, signifcance at each sig.\nVolcano plot\n\nReport list of marker candidate gene IDs for a cell type of choice. Justify filters. Table with fold FC, p values, IDs, canonical gene names\nInterpret the biology by reporting on a few group of genes and the processes in which they are involved.\nReport on your chosen genes and explain why you think they are good candidates for follow up work" }, { "objectID": "core/week-2/workshop.html#omics", @@ -953,5 +909,124 @@ "Week 2: Workflow tips", "Workshop" ] + }, + { + "objectID": "omics/week-3/study_before_workshop.html#the-data", + "href": "omics/week-3/study_before_workshop.html#the-data", + "title": "Independent Study to prepare for workshop", + "section": "The Data", + "text": "The Data\nThere are three datasets\n\n🐸 transcriptomic data (bulk RNA-seq) from frog embryos.\n🐭 transcriptomic data (single cell RNA-seq) from stemcells\n🍂 ??????? Metabolomic / Metagenomic data from anaerobic digesters" + }, + { + "objectID": "omics/week-3/study_before_workshop.html#experimental-design", + "href": "omics/week-3/study_before_workshop.html#experimental-design", + "title": "Independent Study to prepare for workshop", + "section": "🐸 Experimental design", + "text": "🐸 Experimental design\n\nSchematic of frog development experiment" + }, + { + "objectID": "omics/week-3/study_before_workshop.html#experimental-design-1", + "href": "omics/week-3/study_before_workshop.html#experimental-design-1", + "title": "Independent Study to prepare for workshop", + "section": "🐸 Experimental design", + "text": "🐸 Experimental design\n\nSchematic of frog development experiment" + }, + { + "objectID": "omics/week-3/study_before_workshop.html#experimental-design-2", + "href": "omics/week-3/study_before_workshop.html#experimental-design-2", + "title": "Independent Study to prepare for workshop", + "section": "🐸 Experimental design", + "text": "🐸 Experimental design\n\nSchematic of frog development experiment\n\n3 fertilisations\ntwo siblings from each fertilisation one control, on FGF treated\nsequenced at three time points\n3 x 2 x 3 = 18 groups" + }, + { + "objectID": "omics/week-3/study_before_workshop.html#section", + "href": "omics/week-3/study_before_workshop.html#section", + "title": "Independent Study to prepare for workshop", + "section": "", + "text": "find important genes\nwhat is ‘important’\n\nexpressed\ndifferentially expressed i.e., which genes are significantly higher in one group vs another\n\n\nthe difference between HSPC and Prog cells the difference between the control and the FGF treated sibling at S30 ## Sequence Data\n\nThe raw data are “reads” from a sequencing machine. Reads are short sequences of DNA or RNA.\nThe reads are aligned to a reference genome or transcriptome. The reads are then counted to quantify the expression of each gene. The counts are normalised to allow comparison between samples.\nreads\nquality control\nalign/pseudoalign\nquantify\nnormalise" + }, + { + "objectID": "omics/week-3/study_before_workshop.html#revise-pivot-longer", + "href": "omics/week-3/study_before_workshop.html#revise-pivot-longer", + "title": "Independent Study to prepare for workshop", + "section": "revise pivot longer", + "text": "revise pivot longer" + }, + { + "objectID": "omics/week-3/study_before_workshop.html#revise-pivot-longer-1", + "href": "omics/week-3/study_before_workshop.html#revise-pivot-longer-1", + "title": "Independent Study to prepare for workshop", + "section": "revise pivot longer", + "text": "revise pivot longer\n\n\n🔗 About Omics 1: Hello data!\n\n\n\nNestorowa, Sonia, Fiona K. Hamey, Blanca Pijuan Sala, Evangelia Diamanti, Mairi Shepherd, Elisa Laurenti, Nicola K. Wilson, David G. Kent, and Berthold Göttgens. 2016. “A Single-Cell Resolution Map of Mouse Hematopoietic Stem and Progenitor Cell Differentiation.” Blood 128 (8): e20–31. https://doi.org/10.1182/blood-2016-05-716480." + }, + { + "objectID": "omics/week-3/study_before_workshop.html#nb-still-in-construction", + "href": "omics/week-3/study_before_workshop.html#nb-still-in-construction", + "title": "Independent Study to prepare for workshop", + "section": "🚧 NB still in construction 🚧", + "text": "🚧 NB still in construction 🚧" + }, + { + "objectID": "omics/week-3/study_before_workshop.html#overview", + "href": "omics/week-3/study_before_workshop.html#overview", + "title": "Independent Study to prepare for workshop", + "section": "Overview", + "text": "Overview\n\n\nConcise summary of the experimental design and aims\nWhat the raw data consist of\nWhat has been done to the data so far\nWhat steps we will take in the workshop" + }, + { + "objectID": "omics/week-3/study_before_workshop.html#experimental-design-3", + "href": "omics/week-3/study_before_workshop.html#experimental-design-3", + "title": "Independent Study to prepare for workshop", + "section": "🐸 Experimental design", + "text": "🐸 Experimental design\n\nSchematic of frog development experiment\n\n3 fertilisations. These are the replicates, .5, .6, A\ntwo siblings from each fertilisation one control, one FGF treated. The treatments are paired\nsequenced at three time points. S14, S20, S30\n3 x 2 x 3 = 18 groups" + }, + { + "objectID": "omics/week-3/study_before_workshop.html#aim", + "href": "omics/week-3/study_before_workshop.html#aim", + "title": "Independent Study to prepare for workshop", + "section": "🐸 Aim", + "text": "🐸 Aim\n\n\nfind genes important in frog development\nImportant means genes that are differentially expressed between the control and the FGF treated sibling\nDifferentially expressed means the expression on one group is signifcantly higher than the other" + }, + { + "objectID": "omics/week-3/study_before_workshop.html#guided-analysis", + "href": "omics/week-3/study_before_workshop.html#guided-analysis", + "title": "Independent Study to prepare for workshop", + "section": "🐸 Guided analysis", + "text": "🐸 Guided analysis\n\n\nThe workshops will take you through comparing the control and FGF treated sibling at S30\nThis is the “least interesting” comparison\nYou will be guided to carefully document your work so you can apply the same methods to other comparisons" + }, + { + "objectID": "omics/week-3/study_before_workshop.html#experimental-design-4", + "href": "omics/week-3/study_before_workshop.html#experimental-design-4", + "title": "Independent Study to prepare for workshop", + "section": "🐭 Experimental design", + "text": "🐭 Experimental design\n\nSchematic of stem cell experiment" + }, + { + "objectID": "omics/week-3/study_before_workshop.html#experimental-design-5", + "href": "omics/week-3/study_before_workshop.html#experimental-design-5", + "title": "Independent Study to prepare for workshop", + "section": "🐭 Experimental design", + "text": "🐭 Experimental design\n\nSchematic of stem cell experiment\n\nCells were sorted using flow cytometry on the basis of cell surface markers\nThere are three cell types: LT-HSCs, HSPCs, Progs\nMany cells of each cell type were sequenced" + }, + { + "objectID": "omics/week-3/study_before_workshop.html#experimental-design-6", + "href": "omics/week-3/study_before_workshop.html#experimental-design-6", + "title": "Independent Study to prepare for workshop", + "section": "🐭 Experimental design", + "text": "🐭 Experimental design\n\nSchematic of stem cell experiment\n\nThere are three cell types: LT-HSCs, HSPCs, Progs These are the “treaments”\nMany cells of each cell type were sequenced: These are the replicates\n155 LT-HSCs, 701 HSPCs, 798 Progs" + }, + { + "objectID": "omics/week-3/study_before_workshop.html#aim-1", + "href": "omics/week-3/study_before_workshop.html#aim-1", + "title": "Independent Study to prepare for workshop", + "section": "🐭 Aim", + "text": "🐭 Aim\n\n\nfind genes for cell surface proteins that are important in stem cell identity\nImportant means genes that are differentially expressed between at least two cell types\nDifferentially expressed means the expression on one group is significantly higher than the other" + }, + { + "objectID": "omics/week-3/study_before_workshop.html#guided-analysis-1", + "href": "omics/week-3/study_before_workshop.html#guided-analysis-1", + "title": "Independent Study to prepare for workshop", + "section": "🐭 Guided analysis", + "text": "🐭 Guided analysis\n\n\nThe workshops will take you through comparing the HSPC and Prog cells\nThis is the “least interesting” comparison\nYou will be guided to carefully document your work so you can apply the same methods to other comparisons\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n🔗 About Omics 1: Hello data!" } ] \ No newline at end of file diff --git a/omics/week-3/images/88H-exp-design-jillian.png b/omics/week-3/images/88H-exp-design-jillian.png new file mode 100644 index 0000000..e46a2ba Binary files /dev/null and b/omics/week-3/images/88H-exp-design-jillian.png differ diff --git a/omics/week-3/study_before_workshop.qmd b/omics/week-3/study_before_workshop.qmd index f63d162..fac0482 100644 --- a/omics/week-3/study_before_workshop.qmd +++ b/omics/week-3/study_before_workshop.qmd @@ -16,153 +16,277 @@ editor: --- +## 🚧 NB still in construction 🚧 -## Omics workshops -What the data are and something about what has been done to the raw data to get get. +## Overview -- 🐸 the difference between the control and the FGF treated sibling at - S30 -- 🐭 the difference between HSPC and Prog cells -- 🍂 ??????? -## Aims +::: incremental -- find important genes -- what is 'important' - - expressed - - differentially expressed i.e., which genes are significantly higher in one group vs another +- Concise summary of the experimental design and aims +- What the raw data consist of +- What has been done to the data so far +- What steps we will take in the workshop -## Sequence Data +::: -- The raw data are "reads" from a sequencing machine. Reads are short sequences of DNA or RNA. +## The Data +There are three datasets -- The reads are aligned to a reference genome or transcriptome. The reads are then counted to quantify the expression of each gene. The counts are normalised to allow comparison between samples. -- reads -- quality control -- align/pseudoalign -- quantify -- normalise +- 🐸 transcriptomic data (bulk RNA-seq) from frog embryos. +- 🐭 transcriptomic data (single cell RNA-seq) from stemcells -## What is a read +- 🍂 ??????? Metabolomic / Metagenomic data from anaerobic digesters -- FASTQ format files - - sequences and information about each sequence's read accuracy +# Experimental design -## Differential expression -- what is differentially expressed -- how are DE genes related -- what are DE genes involved with +## 🐸 Experimental design {auto-animate="true"} -## Stem cells: background +![Schematic of frog development +experiment](images/88H-exp-design-betsy.png){fig-align="center" +width="700"} -Cells were sorted using flow cytometry on the basis of cell surface markers. There are three cell types: +## 🐸 Experimental design {auto-animate="true"} -- long-term haematopoetic stem cells (LT-HSCs) +![Schematic of frog development +experiment](images/88H-exp-design-betsy.png){fig-align="left" +width="200"} -- haematopoetic stem and progenitor cells (HSPCs) +::: incremental +- 3 fertilisations -- progenitor cells (Progs) +- two siblings from each fertilisation one control, on FGF treated -Each cell is then sequenced to quantify all the transcripts in each cell. different transcripts (genes) were identified. +- sequenced at three time points +- 3 x 2 x 3 = 18 groups +::: -## Stem cells: background +## 🐸 Experimental design {auto-animate="true"} -- Raw data: [GEO Series GSE81682](https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE81682) -- Illumina HiSeq -- short reads 150-300bp -- [A single-cell resolution map of mouse hematopoietic stem and progenitor cell differentiation](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5305050/) [@nestorowa2016] -- 3,840 samples -- Reads were aligned using G-SNAP and the mapped reads were assigned to Ensembl genes HTSeq -- GSE81682_HTSeq_counts.txt.gz (bottom of the page). And [GSE81682_HTSeq_counts.txt.zip](../data/jillian/GSE81682_HTSeq_counts.zip) +![Schematic of frog development +experiment](images/88H-exp-design-betsy.png){fig-align="left" +width="200"} -## Stem cells: Processing so far +::: incremental +- 3 fertilisations. [These are the replicates, `.5`, `.6`, + `A`]{style="color:#009900"} -- selection of cells -- selection of genes: subset of surfaceome -- log2 normalised values +- two siblings from each fertilisation one control, one FGF treated. + [The treatments are paired]{style="color:#009900"} -## Stem cells: Aims +- sequenced at three time points. [S14, S20, + S30]{style="color:#009900"} -- Find interesting **cell surface molecule genes** that vary between cell types. +- 3 x 2 x 3 = 18 groups +::: +## 🐸 Aim -## Frog development: background +::: incremental -- 3 fertilisations = replicates -- two siblings from each fertilisation one control, on FGF treated -- sequenced at three time points: S14, S20, S30 -- 3 x 2 x 3 = 18 groups +- find genes important in frog development + +- Important means genes that are differentially expressed between the control and the FGF treated sibling + +- Differentially expressed means the expression on one group is signifcantly higher than the other + +::: + + +## 🐸 Guided analysis + +::: incremental + +- The workshops will take you through comparing the control and FGF treated sibling at S30 + +- This is the "least interesting" comparison + +- You will be guided to carefully document your work so you can apply the same methods to other comparisons + +::: + + + + +## 🐭 Experimental design {auto-animate="true"} + +![Schematic of stem cell experiment](images/88H-exp-design-jillian.png){fig-align="center" +width="700"} + +## 🐭 Experimental design {auto-animate="true"} + +![Schematic of stem cell experiment](images/88H-exp-design-jillian.png){fig-align="left" +width="200"} + +::: incremental +- Cells were sorted using flow cytometry on the basis of cell surface +markers + +- There are three cell types: LT-HSCs, HSPCs, Progs + +- Many cells of each cell type were sequenced + +- +::: + +## 🐭 Experimental design {auto-animate="true"} + +![Schematic of stem cell experiment](images/88H-exp-design-jillian.png){fig-align="left" +width="200"} + +::: incremental + +- There are three cell types: LT-HSCs, HSPCs, Progs [These are the "treaments"]{style="color:#009900"} + + +- Many cells of each cell type were sequenced: [These are the replicates]{style="color:#009900"} + +- [155 LT-HSCs, 701 HSPCs, 798 Progs]{style="color:#009900"} + +::: + +## 🐭 Aim + +::: incremental + +- find genes for cell surface proteins that are important in stem cell identity + +- Important means genes that are differentially expressed between at least two cell types + +- Differentially expressed means the expression on one group is significantly higher than the other + +::: + + +## 🐭 Guided analysis + +::: incremental + +- The workshops will take you through comparing the HSPC and Prog cells + +- This is the "least interesting" comparison + +- You will be guided to carefully document your work so you can apply the same methods to other comparisons + +::: + + + + + + + + + + + + + + + + + + + + + + + + + + + + + -![](images/88H-exp-design-betsy.png) + + + -## Frog development: background + + -- Raw data: [GEO Series GSE81682](https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE81682) -- Illumina HiSeq -- short reads 150-300bp -- [A single-cell resolution map of mouse hematopoietic stem and progenitor cell differentiation](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5305050/) [@nestorowa2016] -- 3,840 samples -- Reads were aligned using G-SNAP and the mapped reads were assigned to Ensembl genes HTSeq -- GSE81682_HTSeq_counts.txt.gz (bottom of the page). And [GSE81682_HTSeq_counts.txt.zip](../data/jillian/GSE81682_HTSeq_counts.zip) + - + + + + -## Frog development: Processing so far -- selection of cells -- selection of genes: subset of surfaceome -- log2 normalised values -## Frog development: Aims -- Find interesting **cell surface molecule genes** that vary between cell types. + + + + + + + + + + + + + + + + + + + + + -## + -## Deliverables + + + + + + + + -1. Describe the data - - Number of cells/samples/reps/treatments - - number of genes - - type of expression values - - prior processing - - missing values - - overview of expression - - clustering of genes/samples + -2. Report on differential expression between two groups - - number of DE at 1%, 5% and 10%. - - table of expression, fold changes, signifcance at each sig. - - Volcano plot + + + -3. Report list of marker candidate gene IDs for a cell type of choice. Justify filters. Table with fold FC, p values, IDs, canonical gene names + + + -4. Interpret the biology by reporting on a few group of genes and the processes in which they are involved. + + -5. Report on your chosen genes and explain why you think they are good candidates for follow up work + + + -## revise pivot longer -## revise pivot longer \ No newline at end of file + diff --git a/update-notes.txt b/update-notes.txt index 0417ddb..e8a7493 100644 --- a/update-notes.txt +++ b/update-notes.txt @@ -46,4 +46,16 @@ for workshop 2, make sure you examine which frog genes are 0 in all of one sampl which genes matters: first do venn diagram, present absent then to differential expressions - +- Raw data: [GEO Series + GSE81682](https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE81682) +- Illumina HiSeq +- short reads 150-300bp +- [A single-cell resolution map of mouse hematopoietic stem and + progenitor cell + differentiation](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5305050/) + [@nestorowa2016] +- 3,840 samples +- Reads were aligned using G-SNAP and the mapped reads were assigned + to Ensembl genes HTSeq +- GSE81682_HTSeq_counts.txt.gz (bottom of the page). And + [GSE81682_HTSeq_counts.txt.zip](../data/jillian/GSE81682_HTSeq_counts.zip)