Welcome to Introduction to Bioinformatics Analysis, a course taught by Phillip A Richmond.
The purpose of this course is to provide an introduction to Bioinformatics and Genomics as it pertains to short-read sequencing analysis, with a specific focus on brewing and fermentation yeast strains. Taught in an inverted classroom format, there will be screencasts and lecture notes for each section that can be gone through independently, and in class we will simply work on example datasets and problem sets.
The course is intended for academic researchers at Canadian institutions in Western Canada, that have access to nationally supported Research Computing through Compute Canada, specifically the WestGrid branch.
- Prerequisites
- WestGrid access is required, and is different for a PI vs. a student
- Westgrid Account information here
- For a Westgrid Account, register by following these instructions. Realize that the process can take several days!
- GitHub Account
- It's free, and you can sign up here
- For mac/linux users, only need native terminal which comes with the operating system
- For PC users, download and install MobaXterm
- IGV installed on local machine
- WestGrid access is required, and is different for a PI vs. a student
- General Course Info
- Videos are hosted via YouTube at Phil Richmond's YouTube Channel here
- Slides, Assignments, and Quizzes are hosted via Google Drive here
- Extra resources:
- Linux/Unix Cheatsheet
- Linux/Unix online tutorial
- Editor cheat sheets
- File transfer programs
- WestGrid resources
-
Monday August 15th, 2016.
2:00 PM - 4:00 PM.
Section I-1, Section I-2, Section I-3a -
Tuesday August 16th, 2016
9:30 AM - 11:30 AM
Section I-3a finish, Section I-3b -
Wednesday August 17th, 2016
9:30 AM - 11:30 AM
Section I-4 -
Thursday August 18th, 2016
9:30 AM - 11:30 AM
Section I-5, I-6 -
Friday August 19th, 2016
9:30 AM - 11:30 AM
Section I-7
- Introduction to Next Generation Sequencing, Bioinformatics, and Computing
- Getting set-up on WestGrid and using terminal
- Open up a terminal
- Customize terminal
- Login to WestGrid
- Specs of Orcinus
- Slides
- Video
- PC User MobaXterm guide
- In Class Assignment Section I-2
- Total in class time: 30 minutes
- Exploring Linux/Unix
- Intro to linux environment (3a)
- Intermediate Linux commands (3b)
- In Class Assignment Section I-3
- For more practice, do these exercises
- Advanced Linux/Unix
- File Management
- Compression
- Downloading (wget)
- Remote Copy (scp)
- File Editing
- vi, emacs, nano: Pick emacs
- Emacs cheet sheet
- vi cheat sheet
- nano cheat sheet
- Other Secure Copy programs
- Slides
- Video
- In Class Assignment
- Intro to Read Mapping and Data Visualization
- Bowtie2 & Samtools (5a)
- Data visualization (5b)
- In Class Assignment
- Organization and file naming
- Interacting with the queue
- Job Submission
- Job Tracking
- WestGrid Job Running Resource
- Slides
- Video
- In Class Assignment
The in-class test will be an individual examination, so you won't be able to work together in groups for it. The test will be comprehensive for all the things we learned in Section I, and if you are able to complete the assignments for each section, then the test should take only ~ 30 minutes. You will have a 2 hour block to complete the test. If you are unable to do so, then you won't be able to move on to Section II.
Section II: Introduction to Applied Genomics: Acquiring Data, Raw data QC, Variant Calling, Assembly, RNAseq
NOTE For section II, you will need to ask WestGrid for special access to the high memory servers: Hungabee, Breezy. Email: [email protected]
- Getting data from the SRA, Raw Data QC
- Short read archive & fastq.dump
- FastQC
- Trimmomatic
- Slides
- Video
- RNAseq (grape vine)
- TopHat
- Cufflinks --> Cuffmerge --> Cuffdiff
- Slides
- Video
- In Class Assignment
- Transposon Insertion Profiling (TIP-seq)
- Initial Processing (3a)
- Clean up reads
- Map with BWA
- Samtools Unique mapped
- Remove duplicates
- Visualize
- Slides
- Video
- Differential Insertion (3b)
- Call Peaks
- Differential Insertion Analysis
- Slides
- Video
- Transcriptome Assembly
- Trinity
- Trans-Abyss
- Slides
- Video
- In Class Assignment
- Variant Calling (Saccharomyces cerevisiae)
- Map with BWA
- Samtools mpileup
- Picard MarkDuplicates
- Samtools Unique Reads
- Visualize in IGV
- Slides
- Video
- In Class Assignment
This section is open to those who want to explore their own datasets and receive help with processing and data analysis.
Need to insert some testimonials here
Feedback? Suggestions? Don't hesitate to contact me:
Course Instructor | Affiliation | Email Address(es) | github ID | Phone Number |
---|---|---|---|---|
Phillip Richmond | PhD Student, Bioinformatics, UBC | [email protected] or [email protected] | @Phillip-a-Richmond | (604)655-3595 |