-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #1 from 3mmaRand/draft-core-1
Draft core 1
- Loading branch information
Showing
10 changed files
with
137 additions
and
107 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,71 +1,53 @@ | ||
--- | ||
title: "Core Data Analysis for Group Project" | ||
title: "Core Data Analysis" | ||
toc: true | ||
toc-location: right | ||
--- | ||
|
||
# Content | ||
There are three workshops taken by everyone on BIO00088H and BIO00070M. These are in weeks 1, 2 and 6. The first two cover some useful workflow tips and how to organise your analyses effectively so they are reproducible but you will also have the chance to revise material from stage 1 and 2. | ||
|
||
Good organisation is important because you will want to be able to set work aside for holidays and assessment periods and then restart easily. You will also be assessed on the organisation, reproducibility and transparency of your work. | ||
|
||
## Week 1 Core 1 Organising Reproducible Data Analyses | ||
## Week 1 Core 1 Organising reproducible data analyses | ||
|
||
Note no R coding (too early for MSc BIN 70M who share the core and omics teaching) | ||
This week you will revise some essential concepts for scientific computing: file system organisation, file types, working directories and paths. The workshop will cover a rationale for working reproducibly, project oriented workflow, naming things and documenting your work. We will also examine some file types and the concept of tidy data. | ||
|
||
Before | ||
- Optional revision: What they forgot to teach you about computers: operating systems, file systems, file types, working directories and paths | ||
|
||
- Why reproducibility (BIO00058M-Data-science-2020/slides/03_repro_and_intro_to_rmarkdown) | ||
## Week 2 Core 2 Workflow tips | ||
|
||
Workshop | ||
<!-- Before --> | ||
<!-- Apply to GitHub Global Campus as a student --> | ||
|
||
- Project organisation: folders, files | ||
- Project oriented workflow | ||
- Naming things | ||
- File formats | ||
- Data management | ||
- google drive: https://www.york.ac.uk/it-services/services/drive/#tab-6 | ||
- documenting | ||
- organisation within files | ||
- Data files. Similarities and differences | ||
- Sequences data | ||
- Image data | ||
- Structure data. | ||
- Keeping a lab book | ||
- Readme | ||
- Reference managers: Zotero? | ||
<!-- - Reference managers: Zotero --> | ||
<!-- - github copilot --> | ||
<!-- - chat gtp --> | ||
<!-- - Data management --> | ||
<!-- - google drive: https://www.york.ac.uk/it-services/services/drive/#tab-6 --> | ||
|
||
After | ||
|
||
## Week 2 Core 2 NEEDS A TITLE | ||
<!-- Workshop --> | ||
|
||
Before | ||
Possibly: | ||
Code formatting and style, 😎 Cool code Tips, Code 'algorithmically.', Writing functions (R and python??) (BIO00058M-Data-science-2020/slides/03_repro_and_intro_to_rmarkdown) | ||
<!-- Code formatting and style, 😎 Cool code Tips, Code 'algorithmically.', Writing functions (R and python??) (BIO00058M-Data-science-2020/slides/03_repro_and_intro_to_rmarkdown) --> | ||
|
||
|
||
|
||
Workshop | ||
|
||
|
||
|
||
After | ||
<!-- After --> | ||
|
||
## Week 6 Core 3 Reproducible Reporting | ||
|
||
Before | ||
(BIO00058M-Data-science-2020/slides/03_repro_and_intro_to_rmarkdown and BIO00058M-Data-science-2020/slides/04_advanced_rmarkdown.html) | ||
Literate programming | ||
What is quarto | ||
markdown basics: text, code chunks, headings | ||
yaml | ||
automatic numbering of figures and tables | ||
cross references | ||
special characters | ||
citations | ||
<!-- Before --> | ||
<!-- (BIO00058M-Data-science-2020/slides/03_repro_and_intro_to_rmarkdown and BIO00058M-Data-science-2020/slides/04_advanced_rmarkdown.html) --> | ||
<!-- Literate programming --> | ||
<!-- What is quarto --> | ||
<!-- markdown basics: text, code chunks, headings --> | ||
<!-- yaml --> | ||
<!-- automatic numbering of figures and tables --> | ||
<!-- cross references --> | ||
<!-- special characters --> | ||
<!-- citations --> | ||
|
||
<!-- Workshop --> | ||
|
||
Workshop | ||
<!-- practice doing the above with your project and data --> | ||
|
||
practice doing the above with your project and data | ||
|
||
After |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,33 +1,32 @@ | ||
--- | ||
title: "Overview" | ||
subtitle: "Core 1: Organising Data Analyses" | ||
toc: true | ||
toc-location: right | ||
--- | ||
|
||
xxxxx | ||
This week you will revise some essential concepts for scientific computing: file system organisation, file types, working directories and paths. The workshop will cover a rationale for working reproducibly, project oriented workflow, naming things and documenting your work. We will also examine some file types and the concept of tidy data. | ||
|
||
|
||
### Learning objectives | ||
|
||
- dd | ||
- dd. | ||
- dd | ||
- d | ||
The successful student will be able to: | ||
|
||
- explain the organisation of files and directories in a file systems including root, home and working directories | ||
- explain absolute and relative file paths | ||
- explain why working reproducibly is important | ||
- know how to use a project-oriented workflow to organise work | ||
- be able to give files human- and machine-readable names | ||
- outline some common biological data file formats | ||
|
||
### Instructions | ||
|
||
1. [Prepare](study_before_workshop.qmd) | ||
|
||
i. 📖 Read [What they forgot to teach you about computers](https://3mmarand.github.io/comp4biosci/what_they_forgot.html) | ||
i. 📖 Read Understanding file systems | ||
|
||
2. [Workshop](workshop.qmd) | ||
|
||
i. 💻 dd. | ||
|
||
ii. 💻 ddd | ||
|
||
iii. 💻 ddd | ||
|
||
3. [Consolidate](study_after_workshop.qmd) | ||
|
||
i. 💻 dd | ||
|
||
ii. 💻 dd |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,25 +1,22 @@ | ||
--- | ||
title: "Independent Study to consolidate this week" | ||
subtitle: "Core 1" | ||
toc: true | ||
toc-location: right | ||
format: | ||
html: | ||
code-fold: true | ||
code-summary: "Answer - don't look until you have tried!" | ||
--- | ||
|
||
# Set up | ||
## BIO00088H Group Research Project students | ||
|
||
If you have just opened RStudio you will want to load the packages and import the data. | ||
1. Start to build the the file and folder infrastructure for your project | ||
- | ||
- | ||
- | ||
|
||
```{r} | ||
#| code-fold: false | ||
library(tidyverse) | ||
library(readxl) | ||
``` | ||
|
||
1. 💻 xx. | ||
## MSc Bioinformatics students doing BIO00070M | ||
|
||
```{r} | ||
``` | ||
1. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,6 +1,8 @@ | ||
--- | ||
title: "Independent Study to prepare for workshop" | ||
subtitle: "Core 1" | ||
toc: true | ||
toc-location: right | ||
--- | ||
|
||
1. 📖 Read xxxx | ||
1. 📖 Read [Understanding file systems](https://3mmarand.github.io/comp4biosci/file_systems.html). This is an approximately 15 - 20 minute read revising file types and filesystems. It covers concepts of working directories and paths. We learned these ideas in stage 1 and you may feel completely confident with them but many students will benefit from a refresher. For BIO00070M students, this is part of the work you will also be asked to complete for BIO00052M Data Analysis in R. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters