From cfa054187f8b9241542ba4bb338ee292bb071b15 Mon Sep 17 00:00:00 2001
From: 3mmaRand <7593411+3mmaRand@users.noreply.github.com>
Date: Thu, 28 Sep 2023 17:51:53 +0000
Subject: [PATCH] =?UTF-8?q?Deploying=20to=20gh-pages=20from=20@=203mmaRand?=
 =?UTF-8?q?/BIO00088H-data@4230fd0bf94cbb8019c647fcd2c6d551769d3654=20?=
 =?UTF-8?q?=F0=9F=9A=80?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

---
 core/week-1/study_after_workshop.html |  9 ++----
 core/week-1/workshop.html             | 46 +++++++++++++++++----------
 core/week-2/workshop.html             |  9 +++++-
 search.json                           | 28 ++++++++++++----
 4 files changed, 61 insertions(+), 31 deletions(-)
diff --git a/core/week-1/study_after_workshop.html b/core/week-1/study_after_workshop.html
index d51533c..a6bd32b 100644
--- a/core/week-1/study_after_workshop.html
+++ b/core/week-1/study_after_workshop.html
@@ -270,20 +270,17 @@ <h1 class="title">Independent Study to consolidate this week</h1>
 
 </header>
 
+<p>These are suggestions</p>
 <section id="bio00088h-group-research-project-students" class="level2">
 <h2 class="anchored" data-anchor-id="bio00088h-group-research-project-students">BIO00088H Group Research Project students</h2>
 <ol type="1">
-<li><h2 id="start-to-build-the-the-file-and-folder-infrastructure-for-your-project" class="anchored">Start to build the the file and folder infrastructure for your project</h2>
-<ul>
-<li></li>
-<li></li>
-</ul></li>
+<li>Revise previous Data Analysis materials. You can find the version you took on the VLE site for 17C or 08C. However, my latest versions (in development) are here: <a href="https://3mmarand.github.io/R4BABS/">Data Analysis in R</a>. The Becoming a Bioscientist (BABS) modules replace the Laboratory and Professional Skills modules. BABS1 and BABS1 are stage one, and I’ve tried to improve them over 17C and 08C. The site is also searchable (icon top right)</li>
 </ol>
 </section>
 <section id="msc-bioinformatics-students-doing-bio00070m" class="level2">
 <h2 class="anchored" data-anchor-id="msc-bioinformatics-students-doing-bio00070m">MSc Bioinformatics students doing BIO00070M</h2>
 <ol type="1">
-<li></li>
+<li>Make sure you carry out the <a href="https://3mmarand.github.io/R4BABS/pgt52m/week-2/overview.html">preparatory work for week 2 of 52M</a></li>
 </ol>
 
 
diff --git a/core/week-1/workshop.html b/core/week-1/workshop.html
index c35cb8b..4086c24 100644
--- a/core/week-1/workshop.html
+++ b/core/week-1/workshop.html
@@ -288,6 +288,7 @@ <h2 id="toc-title">On this page</h2>
   <li><a href="#code-comments" id="toc-code-comments" class="nav-link" data-scroll-target="#code-comments">Code comments</a></li>
   </ul></li>
   <li><a href="#github-co-pilot-demo" id="toc-github-co-pilot-demo" class="nav-link" data-scroll-target="#github-co-pilot-demo">Github co-pilot demo</a></li>
+  <li><a href="#quarto-demo" id="toc-quarto-demo" class="nav-link" data-scroll-target="#quarto-demo">Quarto demo</a></li>
   <li><a href="#useful-exercises" id="toc-useful-exercises" class="nav-link" data-scroll-target="#useful-exercises">Useful exercises</a></li>
   <li><a href="#well-done" id="toc-well-done" class="nav-link" data-scroll-target="#well-done">🥳 Well Done! 🎉</a></li>
   <li><a href="#independent-study-following-the-workshop" id="toc-independent-study-following-the-workshop" class="nav-link" data-scroll-target="#independent-study-following-the-workshop">Independent study following the workshop</a></li>
@@ -355,9 +356,9 @@ <h2 class="anchored" data-anchor-id="why-does-it-matter">Why does it matter?</h2
 </figure>
 </div>
 <ul>
-<li><p>Five selfish reasons to work reproducibly. Alternatively, see the very entertaining <a href="https://youtu.be/yVT07Sukv9Q">talk</a></p></li>
+<li><p>Five selfish reasons to work reproducibly <span class="citation" data-cites="markowetz2015">(<a href="#ref-markowetz2015" role="doc-biblioref">Markowetz 2015</a>)</span>. Alternatively, see the very entertaining <a href="https://youtu.be/yVT07Sukv9Q">talk</a></p></li>
 <li><p>Many high profile cases of work which did not reproduce e.g.&nbsp;Anil Potti unravelled by <span class="citation" data-cites="baggerly2009">Baggerly and Coombes (<a href="#ref-baggerly2009" role="doc-biblioref">2009</a>)</span></p></li>
-<li><p><strong>Will</strong> become standard in Science and publishing e.g OECD Global Science Forum Building digital workforce capacity and skills for data-intensive science <span class="citation" data-cites="oecdglobalscienceforum2020">OECD Global Science Forum (<a href="#ref-oecdglobalscienceforum2020" role="doc-biblioref">2020</a>)</span></p></li>
+<li><p><strong>Will</strong> become standard in Science and publishing e.g OECD Global Science Forum Building digital workforce capacity and skills for data-intensive science <span class="citation" data-cites="oecdglobalscienceforum2020">(<a href="#ref-oecdglobalscienceforum2020" role="doc-biblioref">OECD Global Science Forum 2020</a>)</span></p></li>
 </ul>
 </section>
 <section id="how-to-achieve-reproducibility" class="level2">
@@ -388,14 +389,15 @@ <h2 class="anchored" data-anchor-id="project-oriented-workflow">Project-oriented
 <ul>
 <li><p>use folders to organise your work</p></li>
 <li><p>you are aiming for structured, systematic and repeatable.</p></li>
+<li><p>inputs and outputs should be clearly identifiable from structure and/or naming</p></li>
 </ul>
-<p>Example</p>
+<p>Examples</p>
 <pre><code>-- liver_transcriptome/
    |__data
       |__raw/
       |__processed/
    |__images/
-   |__R/
+   |__code/
    |__reports/
    |__figures/</code></pre>
 </section>
@@ -407,7 +409,12 @@ <h2 class="anchored" data-anchor-id="naming-things">Naming things</h2>
 <figcaption class="figure-caption">documents, CC-BY-NC, https://xkcd.com/1459/</figcaption>
 </figure>
 </div>
-<p>Guiding principle - names of files and directories should be systematic and readable by humans and machines. Have a convention!</p>
+<p>Guiding principle - Have a convention! Good file names are:</p>
+<ul>
+<li><p>machine readable</p></li>
+<li><p>human readable</p></li>
+<li><p>play nicely with sorting</p></li>
+</ul>
 <p>I suggest</p>
 <ul>
 <li><p>no spaces in names</p></li>
@@ -451,26 +458,21 @@ <h1>Documentation</h1>
 <h2 class="anchored" data-anchor-id="readme-files">Readme files</h2>
 <p>READMEs are a form of documentation which have been widely used for a long time. They contain all the information about the other files in a directory. They can be extensive but need not be. Concise is good. Bullet points are good</p>
 <ul>
-<li><p>Give a project description, brief</p></li>
+<li><p>Give a project title and description, brief</p></li>
+<li><p>start date, last updated date and contact information</p></li>
 <li><p>Outline the folder structure</p></li>
 <li><p>Give software requirements: programs and versions used or required. There are packages that give session information in R <span class="citation" data-cites="sessioninfo">Wickham et al. (<a href="#ref-sessioninfo" role="doc-biblioref">2021</a>)</span> and Python <span class="citation" data-cites="ostblomjoel2019">Ostblom, Joel (<a href="#ref-ostblomjoel2019" role="doc-biblioref">2019</a>)</span></p></li>
 </ul>
 <p>R:</p>
-<pre><code>```
-#| eval: false
-sessioninfo::session_info()
-```</code></pre>
+<p><code>sessioninfo::session_info()</code></p>
 <p>Python:</p>
-<pre><code>```
-#| eval: false
-import session_info
-session_info.show()
-
-```</code></pre>
+<p><code>import session_info</code></p>
+<p><code>session_info.show()</code></p>
 <ul>
 <li><p>Instructions run the code, build reports, and reproduce the figures etc</p></li>
 <li><p>Where to find the data, outputs</p></li>
 <li><p>Any other information that needed to understand and recreate the work</p></li>
+<li><p>Ideally, a summary of changes with the date</p></li>
 </ul>
 <pre><code>-- liver_transcriptome/
    |__data
@@ -510,6 +512,9 @@ <h2 class="anchored" data-anchor-id="code-comments">Code comments</h2>
 <section id="github-co-pilot-demo" class="level1">
 <h1>Github co-pilot demo</h1>
 </section>
+<section id="quarto-demo" class="level1">
+<h1>Quarto demo</h1>
+</section>
 <section id="useful-exercises" class="level1">
 <h1>Useful exercises</h1>
 <ul>
@@ -520,11 +525,15 @@ <h1>Useful exercises</h1>
 <p>🎬 <a href="">Update R</a></p>
 <p>🎬 <a href="https://posit.co/download/rstudio-desktop/">Update RStudio</a>. You will need the prelease <a href="https://dailies.rstudio.com/rstudio/desert-sunflower/">Dessert Sunflower</a> for github Copilot integration</p></li>
 <li><p>Install package building tools</p>
-<p>🎬 Install Rtools (windows) or Xcode (mac)</p></li>
+<p>🎬 Windows Install <a href="https://cran.r-project.org/bin/windows/Rtools/rtools43/rtools.html">Rtools</a></p>
+<p>🎬 Mac install <a href="https://apps.apple.com/ca/app/xcode/id497799835?mt=12">Xcode from Mac App Store</a></p></li>
 <li><p>Update packages:</p>
 <p>🎬 devtools, tidyverse, BiocManager, readxl</p></li>
 <li><p>Install Quarto</p>
 <p>🎬 <a href="https://quarto.org">Install Quarto</a></p></li>
+<li><p>Install Zotero</p>
+<p>🎬 Install <a href="https://www.zotero.org/">Zotero</a></p>
+<p>🎬 <a href="https://www.zotero.org/user/register">Sign up for an account</a></p></li>
 </ul>
 <p>You’re finished!</p>
 </section>
@@ -550,6 +559,9 @@ <h1>Independent study following the workshop</h1>
 <div id="ref-baggerly2009" class="csl-entry" role="listitem">
 Baggerly, Keith A, and Kevin R Coombes. 2009. <span>“DERIVING CHEMOSENSITIVITY FROM CELL LINES: FORENSIC BIOINFORMATICS AND REPRODUCIBLE RESEARCH IN HIGH-THROUGHPUT BIOLOGY.”</span> <em>Ann. Appl. Stat.</em> 3 (4): 1309–34. <a href="https://doi.org/10.2307/27801549">https://doi.org/10.2307/27801549</a>.
 </div>
+<div id="ref-markowetz2015" class="csl-entry" role="listitem">
+Markowetz, Florian. 2015. <span>“Five Selfish Reasons to Work Reproducibly.”</span> <em>Genome Biol.</em> 16 (December): 274. <a href="https://doi.org/10.1186/s13059-015-0850-7">https://doi.org/10.1186/s13059-015-0850-7</a>.
+</div>
 <div id="ref-nationalacademiesofsciences2019" class="csl-entry" role="listitem">
 National Academies of Sciences, Engineering, Medicine, Policy, Global Affairs, Engineering, Medicine Committee on Science, Public Policy, Board on Research Data, et al. 2019. <em>Understanding Reproducibility and Replicability</em>. National Academies Press (US). <a href="https://www.ncbi.nlm.nih.gov/books/NBK547546/">https://www.ncbi.nlm.nih.gov/books/NBK547546/</a>.
 </div>
diff --git a/core/week-2/workshop.html b/core/week-2/workshop.html
index da9ab37..c8cfcee 100644
--- a/core/week-2/workshop.html
+++ b/core/week-2/workshop.html
@@ -283,9 +283,16 @@ <h1 class="title">Workshop</h1>
 <section id="session-overview" class="level2"><h2 class="anchored" data-anchor-id="session-overview">Session overview</h2>
 <p>In this workshop you will</p>
 </section><section id="file-formats" class="level2"><h2 class="anchored" data-anchor-id="file-formats">File formats</h2>
-<p>Data files. - Sequences data - Image data - Structure data</p>
+<p>Data files. - Sequences data - Image data - Structure data PDB/mmCIF www.pdb.org</p>
 <p>Similarities and differences</p>
 <p>🎬</p>
+<p>what is markdown</p>
+<p>Google Colab</p>
+<p>snippets</p>
+<p>python</p>
+<p>differences between r and python</p>
+<p>rstudio terminal</p>
+<p>basic bash</p>
 <p>You’re finished!</p>
 </section></section><section id="well-done" class="level1"><h1>🥳 Well Done! 🎉</h1>
 </section><section id="independent-study-following-the-workshop" class="level1"><h1>Independent study following the workshop</h1>
diff --git a/search.json b/search.json
index e2263f7..2e850f2 100644
--- a/search.json
+++ b/search.json
@@ -6,12 +6,26 @@
     "section": "",
     "text": "About this site"
   },
+  {
+    "objectID": "core/week-1/study_after_workshop.html",
+    "href": "core/week-1/study_after_workshop.html",
+    "title": "Independent Study to consolidate this week",
+    "section": "",
+    "text": "These are suggestions"
+  },
+  {
+    "objectID": "core/week-1/study_after_workshop.html#bio00088h-group-research-project-students",
+    "href": "core/week-1/study_after_workshop.html#bio00088h-group-research-project-students",
+    "title": "Independent Study to consolidate this week",
+    "section": "BIO00088H Group Research Project students",
+    "text": "BIO00088H Group Research Project students\n\nRevise previous Data Analysis materials. You can find the version you took on the VLE site for 17C or 08C. However, my latest versions (in development) are here: Data Analysis in R. The Becoming a Bioscientist (BABS) modules replace the Laboratory and Professional Skills modules. BABS1 and BABS1 are stage one, and I’ve tried to improve them over 17C and 08C. The site is also searchable (icon top right)"
+  },
   {
     "objectID": "core/week-1/study_after_workshop.html#msc-bioinformatics-students-doing-bio00070m",
     "href": "core/week-1/study_after_workshop.html#msc-bioinformatics-students-doing-bio00070m",
     "title": "Independent Study to consolidate this week",
     "section": "MSc Bioinformatics students doing BIO00070M",
-    "text": "MSc Bioinformatics students doing BIO00070M"
+    "text": "MSc Bioinformatics students doing BIO00070M\n\nMake sure you carry out the preparatory work for week 2 of 52M"
   },
   {
     "objectID": "core/week-1/workshop.html",
@@ -39,7 +53,7 @@
     "href": "core/week-1/workshop.html#why-does-it-matter",
     "title": "Workshop",
     "section": "Why does it matter?",
-    "text": "Why does it matter?\n\n\n\nfutureself, CC-BY-NC, by Julen Colomb\n\n\n\nFive selfish reasons to work reproducibly. Alternatively, see the very entertaining talk\nMany high profile cases of work which did not reproduce e.g. Anil Potti unravelled by Baggerly and Coombes (2009)\nWill become standard in Science and publishing e.g OECD Global Science Forum Building digital workforce capacity and skills for data-intensive science OECD Global Science Forum (2020)"
+    "text": "Why does it matter?\n\n\n\nfutureself, CC-BY-NC, by Julen Colomb\n\n\n\nFive selfish reasons to work reproducibly (Markowetz 2015). Alternatively, see the very entertaining talk\nMany high profile cases of work which did not reproduce e.g. Anil Potti unravelled by Baggerly and Coombes (2009)\nWill become standard in Science and publishing e.g OECD Global Science Forum Building digital workforce capacity and skills for data-intensive science (OECD Global Science Forum 2020)"
   },
   {
     "objectID": "core/week-1/workshop.html#how-to-achieve-reproducibility",
@@ -60,21 +74,21 @@
     "href": "core/week-1/workshop.html#project-oriented-workflow",
     "title": "Workshop",
     "section": "Project-oriented workflow",
-    "text": "Project-oriented workflow\n\nuse folders to organise your work\nyou are aiming for structured, systematic and repeatable.\n\nExample\n-- liver_transcriptome/\n   |__data\n      |__raw/\n      |__processed/\n   |__images/\n   |__R/\n   |__reports/\n   |__figures/"
+    "text": "Project-oriented workflow\n\nuse folders to organise your work\nyou are aiming for structured, systematic and repeatable.\ninputs and outputs should be clearly identifiable from structure and/or naming\n\nExamples\n-- liver_transcriptome/\n   |__data\n      |__raw/\n      |__processed/\n   |__images/\n   |__code/\n   |__reports/\n   |__figures/"
   },
   {
     "objectID": "core/week-1/workshop.html#naming-things",
     "href": "core/week-1/workshop.html#naming-things",
     "title": "Workshop",
     "section": "Naming things",
-    "text": "Naming things\n\n\n\ndocuments, CC-BY-NC, https://xkcd.com/1459/\n\n\nGuiding principle - names of files and directories should be systematic and readable by humans and machines. Have a convention!\nI suggest\n\nno spaces in names\nuse snake_case or kebab-case rather than CamelCase or dot.case\nuse all lower case except very occasionally where convention is otherwise, e.g., README, LICENSE\nordering: use left-padded numbers e.g., 01, 02….99 or 001, 002….999\ndates ISO 8601 format: 2020-10-16\nwrite down your conventions\n\n-- liver_transcriptome/\n   |__data\n      |__raw/\n         |__2022-03-21_donor_1.csv\n         |__2022-03-21_donor_2.csv\n         |__2022-03-21_donor_3.csv\n         |__2022-05-14_donor_1.csv\n         |__2022-05-14_donor_2.csv\n         |__2022-05-14_donor_3.csv\n      |__processed/\n   |__images/\n   |__code/\n      |__functions/\n         |__summarise.R\n         |__normalise.R\n         |__theme_volcano.R\n      |__01_data_processing.py\n      |__02_exploratory.R\n      |__03_modelling.R\n      |__04_figures.R\n   |__reports/\n      |__01_report.qmd\n      |__02_supplementary.qmd\n   |__figures/\n      |__01_volcano_donor_1_vs_donor_2.eps\n      |__02_volcano_donor_1_vs_donor_3.eps"
+    "text": "Naming things\n\n\n\ndocuments, CC-BY-NC, https://xkcd.com/1459/\n\n\nGuiding principle - Have a convention! Good file names are:\n\nmachine readable\nhuman readable\nplay nicely with sorting\n\nI suggest\n\nno spaces in names\nuse snake_case or kebab-case rather than CamelCase or dot.case\nuse all lower case except very occasionally where convention is otherwise, e.g., README, LICENSE\nordering: use left-padded numbers e.g., 01, 02….99 or 001, 002….999\ndates ISO 8601 format: 2020-10-16\nwrite down your conventions\n\n-- liver_transcriptome/\n   |__data\n      |__raw/\n         |__2022-03-21_donor_1.csv\n         |__2022-03-21_donor_2.csv\n         |__2022-03-21_donor_3.csv\n         |__2022-05-14_donor_1.csv\n         |__2022-05-14_donor_2.csv\n         |__2022-05-14_donor_3.csv\n      |__processed/\n   |__images/\n   |__code/\n      |__functions/\n         |__summarise.R\n         |__normalise.R\n         |__theme_volcano.R\n      |__01_data_processing.py\n      |__02_exploratory.R\n      |__03_modelling.R\n      |__04_figures.R\n   |__reports/\n      |__01_report.qmd\n      |__02_supplementary.qmd\n   |__figures/\n      |__01_volcano_donor_1_vs_donor_2.eps\n      |__02_volcano_donor_1_vs_donor_3.eps"
   },
   {
     "objectID": "core/week-1/workshop.html#readme-files",
     "href": "core/week-1/workshop.html#readme-files",
     "title": "Workshop",
     "section": "Readme files",
-    "text": "Readme files\nREADMEs are a form of documentation which have been widely used for a long time. They contain all the information about the other files in a directory. They can be extensive but need not be. Concise is good. Bullet points are good\n\nGive a project description, brief\nOutline the folder structure\nGive software requirements: programs and versions used or required. There are packages that give session information in R Wickham et al. (2021) and Python Ostblom, Joel (2019)\n\nR:\n```\n#| eval: false\nsessioninfo::session_info()\n```\nPython:\n```\n#| eval: false\nimport session_info\nsession_info.show()\n\n```\n\nInstructions run the code, build reports, and reproduce the figures etc\nWhere to find the data, outputs\nAny other information that needed to understand and recreate the work\n\n-- liver_transcriptome/\n   |__data\n      |__raw/\n         |__2022-03-21_donor_1.csv\n         |__2022-03-21_donor_2.csv\n         |__2022-03-21_donor_3.csv\n         |__2022-05-14_donor_1.csv\n         |__2022-05-14_donor_2.csv\n         |__2022-05-14_donor_3.csv\n      |__processed/\n   |__images/\n   |__code/\n      |__functions/\n         |__summarise.R\n         |__normalise.R\n         |__theme_volcano.R\n      |__01_data_processing.py\n      |__02_exploratory.R\n      |__03_modelling.R\n      |__04_figures.R\n   |__README.md\n   |__reports/\n      |__01_report.qmd\n      |__02_supplementary.qmd\n   |__figures/\n      |__01_volcano_donor_1_vs_donor_2.eps\n      |__02_volcano_donor_1_vs_donor_3.eps"
+    "text": "Readme files\nREADMEs are a form of documentation which have been widely used for a long time. They contain all the information about the other files in a directory. They can be extensive but need not be. Concise is good. Bullet points are good\n\nGive a project title and description, brief\nstart date, last updated date and contact information\nOutline the folder structure\nGive software requirements: programs and versions used or required. There are packages that give session information in R Wickham et al. (2021) and Python Ostblom, Joel (2019)\n\nR:\nsessioninfo::session_info()\nPython:\nimport session_info\nsession_info.show()\n\nInstructions run the code, build reports, and reproduce the figures etc\nWhere to find the data, outputs\nAny other information that needed to understand and recreate the work\nIdeally, a summary of changes with the date\n\n-- liver_transcriptome/\n   |__data\n      |__raw/\n         |__2022-03-21_donor_1.csv\n         |__2022-03-21_donor_2.csv\n         |__2022-03-21_donor_3.csv\n         |__2022-05-14_donor_1.csv\n         |__2022-05-14_donor_2.csv\n         |__2022-05-14_donor_3.csv\n      |__processed/\n   |__images/\n   |__code/\n      |__functions/\n         |__summarise.R\n         |__normalise.R\n         |__theme_volcano.R\n      |__01_data_processing.py\n      |__02_exploratory.R\n      |__03_modelling.R\n      |__04_figures.R\n   |__README.md\n   |__reports/\n      |__01_report.qmd\n      |__02_supplementary.qmd\n   |__figures/\n      |__01_volcano_donor_1_vs_donor_2.eps\n      |__02_volcano_donor_1_vs_donor_3.eps"
   },
   {
     "objectID": "core/week-1/workshop.html#code-comments",
@@ -95,7 +109,7 @@
     "href": "core/week-2/workshop.html",
     "title": "Workshop",
     "section": "",
-    "text": "In this workshop you will\n\nData files. - Sequences data - Image data - Structure data\nSimilarities and differences\n🎬\nYou’re finished!"
+    "text": "In this workshop you will\n\nData files. - Sequences data - Image data - Structure data PDB/mmCIF www.pdb.org\nSimilarities and differences\n🎬\nwhat is markdown\nGoogle Colab\nsnippets\npython\ndifferences between r and python\nrstudio terminal\nbasic bash\nYou’re finished!"
   },
   {
     "objectID": "core/week-2/workshop.html#session-overview",
@@ -109,7 +123,7 @@
     "href": "core/week-2/workshop.html#file-formats",
     "title": "Workshop",
     "section": "",
-    "text": "Data files. - Sequences data - Image data - Structure data\nSimilarities and differences\n🎬\nYou’re finished!"
+    "text": "Data files. - Sequences data - Image data - Structure data PDB/mmCIF www.pdb.org\nSimilarities and differences\n🎬\nwhat is markdown\nGoogle Colab\nsnippets\npython\ndifferences between r and python\nrstudio terminal\nbasic bash\nYou’re finished!"
   },
   {
     "objectID": "core/week-6/study_after_workshop.html",